CHIMERIC GENES AND METHODS FOR 
INCREASING THE LYSINE AND THREONINE 
CONTENT OF THE SEEDS OF PLANTS 
This is a continuation-in-part of Serial No. 08/178,212, filed on 1/6/94 which 
was the national filing of PCT/US93/02480, filed on 3/18/93 and which is a 
continuation-in-part of Serial No. 07/855,414, filed on 3/19/92. 

TECHNICAL FIELD 
This invention relates to four chimeric genes, a first encoding lysine- 
insensitive aspartokinase (AK), which is operably linked to a plant chloroplast 
transit sequence, a second encoding lysine-insensitive dihydrodipicolinic acid 
synthase (DHDPS), which is operably linked to a plant chloroplast transit 
sequence, a third encoding a lysine-rich protein, and a fourth encoding a plant 
lysine ketoglutarate reductase, all operably linked to plant seed-specific regulatory 
sequences. Methods for their use to produce increased levels of lysine or threonine 
in the seeds of transformed plants are provided. Also provided are transformed 
plants wherein the seeds accumulate lysine or threonine to higher levels than 
untransformed plants. 

BACKGROUND OF THE INVENTION 
Human food and animal feed derived from many grains are deficient in some 
of the ten essential amino acids which are required in the animal diet. In com (Zea 
mays L.), lysine is the most limiting amino acid for the dietary requirements of 
many animals. Soybean {Glycine max L.) meal is used as an additive to com based 
animal feeds primarily as a lysine supplement. Thus an increase in the lysine 
content of either com or soybean would reduce or eliminate the need to 
supplement mixed grain feeds with lysine produced via fermentation of microbes. 

Plant breeders have long been interested in using naturally occuring 
variations to improve protein quality and quantity in crop plants. Maize lines 
containing higher than normal levels of lysine (70%) have been identified [Mertz et 
al. (1964) Science 145:279, Mertz et al. (1965) Science 750:1469-70]. However, 
these lines which incorporate a mutant gene, opaque-2, exhibit poor agronomic 
qualities (increased susceptibility to disease and pests, 8-14% reduction in yield, 
low kernel weight, slower drying, lower dry milling yield of flaking grits, and 
increased storage problems) and thus are not commercially useful [Deutscher 
(1978) Adv. Exp. Medicine and Biology 105:281-300]. Quality Protein Maize 
(QPM) bred at CIMMYT using the opaque-2 and sugary-2 genes and associated 
modifiers has a hard endosperm and enriched levels of lysine and tryptophan in the 
kernels [Vasal, S. K., et al. Proceedings of the 3rd seed protein symposium f 



Gatersleben, August 31 - September 2, 1983]. However, the gene pools 
represented in the QPM lines are tropical and subtropical. Quality Protein Maize is 
a genetically complex trait and the existing lines are not easily adapted to the dent 
germplasm in use in the United States, preventing the adoption of QPM by corn 
breeders. 

The amino acid content of seeds is determined primarily (90-99%) by the 
amino acid composition of the proteins in the seed and to a lesser extent (1-10%) 
by the free amino acid pools. The quantity of total protein in seeds varies from 
about 10% of the dry weight in cereals to 20-40% of the dry weight of legumes. 
Much of the protein-bound amino acids is contained in the seed storage proteins 
which are synthesized during seed development and which serve as a major 
nutrient reserve following germination. In many seeds the storage proteins account 
for 50% or more of the total protein. 

To improve the amino acid composition of seeds genetic engineering 
technology is being used to isolate, and express genes for storage proteins in 
transgenic plants. For example, a gene from Brazil nut for a seed 2S albumin 
composed of 26% sulfur-containing amino acids has been isolated [Altenbach et al. 
(1987) Plant MoL Biol. 5:239-250] and expressed in the seeds of transformed 
tobacco under the control of the regulatory sequences from a bean phaseolin 
storage protein gene. The accumulation of the sulfur-rich protein in the tobacco 
seeds resulted in an up to 30% increase in the level of methionine in the seeds 
[Altenbach et al. (1989) Plant MoL Biol ii:513-522]. However, no plant seed 
storage proteins similarly enriched in lysine relative to average lysine content of 
plant proteins have been identified to date, preventing this approach from being 
used to increase lysine. 

An alternative approach is to increase the production and accumulation of 
specific free amino acids such as lysine via genetic engineering technology. 
However, little guidance is available on the control of the biosynthesis and 
metabolism of lysine in the seeds of plants. 

Lysine, along with threonine, methionine and isoleucine, are amino acids 
derived from aspartate, and regulation of the biosynthesis of each member of this 
family is interconnected. Regulation of the metabolic flow in the pathway appears 
to be primarily via end products. The first step in the pathway is the 
phosphorylation of aspartate by the enzyme aspartokinase (AK), and this enzyme 
has been found to be an important target for regulation in many organisms. 
However, detailed physiological studies on the flux of 4-carbon molecules through 
the aspartate pathway have been carried out in the model plant system Lemna 
paucicostata [Giovanelli et al. (1989) Plant Physiol. 9(7:1584-1599]. The authors 



state "These data now provide definitive evidence that the step catalyzed by 
aspartokinase is not normally an important site for regulation of the entry of 
4-carbon units into the aspartate family of amino acids [in plants]." 

The aspartate family pathway is also believed to be regulated at the branch- 
point reactions. For lysine this is the condensation of aspartyl {3-semialdehyde with 
pyruvate catalyzed by dihydrodipicolinic acid synthase (DHDPS), while for 
threonine and methionine the reduction of aspartyl |3-semialdehyde by homoserine 
dehydrogenase (HDH) followed by the phosphorylation of homoserine by 
homoserine kinase (HK) are important points of control. 

The E. coli dapA gene encodes a DHDPS enzyme that is about 20-fold less 
sensitive to inhibition by lysine than than a typical plant DHDPS enzyme, e.g., 
wheat germ DHDPS. The E. coli dap A gene has been linked to the 35S promoter 
of Cauliflower Mosaic Virus and a plant chloroplast transit sequence. The 
chimeric gene was introduced into tobacco cells via transformation and shown to 
cause a substantial increase in free lysine levels in leaves [Glassman et aL (1989) 
PCT Patent Appl. PCT/US89/01309, Shaul et al. (1992) Plant Jour. 2:203-209, 
Galili et al. (1992) EPO Patent Appl. 91 1 19328.2]. However, the lysine content of 
the seeds was not increased in any of the transformed plants described in these 
studies. The same chimeric gene was also introduced into potato cells and lead to 
small increases in free lysine in leaves, roots and tubers of regenerated plants [Galili 
et al. (1992) EPO Patent Appl. 91 1 19328.2, Perl et al. (1992) Plant MoL Biol. 
79:815-823]. These workers have also reported on the introduction of an £. coli 
lysC gene that encodes a lysine-insensitive AK enzyme into tobacco cells via 
transformation [Galili et al. (1992) Eur. Patent Appl. 91119328.2; Shaul et al. 
(1992) Plant Physiol. 100:1 157-1 163]. Expression of the E. coli enzyme results in 
increases in the levels of free threonine in the leaves and seeds of transformed 
plants. Crosses of plants expressing E. coli DHDPS and AK resulted in progeny 
that accumulated more free lysine in leaves than the parental DHDPS plant, but 
less free threonine in leaves than the parental AK plant. No evidence for increased 
levels of free lysine in seeds was presented. 

The limited understanding of the details of the regulation of the biosynthetic 
pathway in plants makes the application of genetic engineering technology, 
particularly to seeds, uncertain. There is little information available on the source 
of the aspartate-derived amino acids in seeds. It is not known, for example, 
whether they are synthesized in seeds, or transported to the seeds from leaves, or 
both, from most plants. In addition, free amino acids make up only a small fraction 
of the total amino acid content of seeds. Therefore, over-accumulation of free 
amino acids must be many-fold in order to significantly affect the total amino acid 



composition of the seeds. Furthermore, little is known about catabolism of free 
amino acids in seeds. Catabolism of free lysine has been observed in developing 
endosperm of com and barley. The first step in the catabolism of lysine is believed 
to be catalyzed by lysine-ketoglutarate reductase [Brochetto-Braga et al. (1992) 
Plant Physiol. 98: 1139-1 147]. Whether such catabolic pathways are widespread 
in plants and whether they affect the level of accumulation of free amino acids is 
unknown. Finally, the effects of over-accumulation of a free amino acid such as 
lysine or threonine on seed development and viability is not known. 

Before this patent application no method to increase the level of lysine or 
threonine, or any other amino acid, in seeds via genetic engineering was known. 
Furthermore, no examples of seeds having increased lysine or threonine levels 
obtained via genetic engineering were known before the invention described herein. 
Thus, there is a need for genes, chimeric genes, and methods for expressing them 
in seeds so that an over-accumulation of free amino acids in seeds will result in an 
improvement in nutritional quality. 

SUMMARY OF THE INVENTION 

This invention concerns an isolated nucleic acid fragment comprising: 

(a) a first nucleic acid subfragment encoding apartokinase which is 
substantially insensitive to inhibition by lysine; and 

(b) a second nucleic acid subfragment encoding dihydrcniipicolinic 
acid synthase which is substantially insensitive to inhibition by lysine. 

The invention also concerns an isolated nucleic acid fragment comprising a 
nucleic acid subfragment encoding lysine ketoglutarate reductase. 
Further disclosed herein is a nucleic acid fragment comprising 

(a) a first chimeric gene wherein a nucleic acid fragment encoding 
aspartokinase which is substantially insensitive to inhibition by lysine is operably 
linked to a plant chloroplast transit sequence and to a seed-specific regulatory 
sequence; and 

(b) a second chimeric gene wherein a nucleic acid fragment encoding 
dihydrodipicolinic acid synthase which is substantially insensitive to inhibition by 
lysine is operably linked to a plant chloroplast transit sequence and to a seed- 
specific regulatory sequence. 

Additionally disclosed is an isolated nucleic acid fragment comprising: 

(a) a first chimeric gene wherein a nucleic acid fragment comprising a 
nucleotide sequence essentially similar to the sequence shown in SEQ ID NO:l: 
encoding E, coli ARTE, said nucleic acid fragment encoding a lysine-insensitive 
variant of E. coli AKUI and further characterized in that at least one of the 
following conditions is met: 



(1 ) the amino acid at position 3 18 is an amino acid other than 
threonine, or 

(2) the amino acid at position 352 is an amino acid other than 
methionine is operabiy linked to a plant chloroplast transit 

5 sequence and to a seed-specific regulatory sequence and 

(b) a second chimeric gene wherein a nucleic acid fragment derived 
from a bacteria encoding dihydrodipicolinic acid synthase is operabiy linked to a 
plant chloroplast transit sequence and to a seed-specific regulatory sequence; and 

(c) a third chimeric gene wherein a nucleic acid fragment encoding 
10 part or all of lysine ketoglutarate reductase is operabiy linked in the sense or 

antisense orientation to a seed-specific regulatory sequence. 

Also disclosed is an isolated nucleic acid fragment comprising at least one 
nucleotide sequence essentially similar to the sequence shown in SEQ ID NO:l 
encoding £. coli AKHI, said nucleic acid fragment encoding a lysine-insensitive 
15 variant of E. coli AKHI and further characterized in that at least one of the 
following conditions is met: 

(a) the amino acid at position 318 is an amino acid other than 
threonine, or 

(b) the amino acid at position 352 is an amino acid other than 
20 methionine. 

Also claimed is an embodiment wherein the nucleic acid fragment discussed 
herein is operabiy linked to a plant chloroplast transit sequence and to a seed- 
specific regulatory sequence. 

Plants and seeds comprising in their genomes the described nucleic acid 
25 fragments and/or genes are also disclosed. 

The invention also concerns a method for increasing the threonine content of 
the seeds of plants, and plants produced by such method wherein the plant is 
capable of transmitting said chimeric gene to a progeny plant and wherein the 
progeny plant has the ability to produce levels of free threonine at least two times 
30 greater than the free threonine levels of untransformed plants, 
which method comprises: 

(a) transforming plant cells with the above described chimeric gene; 

(b) growing fertile mature plants from the transformed plant cells 
obtained from step (a) under conditions suitable to obtain seeds; and 

35 (c) selecting from the progeny seed of step (b) for those seeds 

containing increased levels of threonine. 

Also described is a method for increasing the lysine content of the seeds of 
plants and plants produced by such methods wherein the plant is capable of 
transmitting said nucleic acid fragment to a progeny plant and wherein the progeny 



ru 
m 

s 

ry 



plant has the ability to produce levels of free lysine at least two times greater than 
free lysine levels of plants not containing the nucleic acid fragment, which method 
comprises: 

(a) transforming plant cells with the above described nucleic acid 

fragments; 

(b) growing fertile mature plants from the transformed plant cells 
obtained from step (a) under conditions suitable to obtain seeds; and 

(c) selecting from the progeny seed of step (b) those seeds containing 
increased levels of lysine. 

Further disclosed is an isolated nucleic acid fragment comprising: 

(a) a first nucleic acid subfragment encoding apartokinase which is 
substantially insensitive to inhibition by lysine; and 

(b) a second nucleic acid subfragment encoding dihydrodipicolinic 
acid synthase which is substantially insensitive to inhibition by lysine; and 

(c) a third nucleic acid subfragment encoding a lysine-rich protein 
wherein the weight percent lysine is at least 15%. 

Also disclosed herein are an isolated nucleic acid fragment comprising: 

(a) a first chimeric gene wherein a nucleic acid fragment comprising a 
nucleotide sequence essentially similar to the sequence shown in SEQ ID NO:l: 
encoding E. coli AKHI, said nucleic acid fragment encoding a lysine-insensitive 
variant of E. coli AKHI and further characterized in that at least one of the 
following conditions is met: 

( 1 ) the amino acid at position 3 1 8 is an amino acid other than 
threonine, or 

(2) the amino acid at position 352 is an amino acid other than 
methionine is operably linked to a plant chloroplast transit 
sequence and to a seed-specific regulatory sequence and 

(b) a second chimeric gene wherein a nucleic acid fragment derived 
from a bacteria encoding dihydrodipicolinic acid synthase is operably linked to a 
plant chloroplast transit sequence and to a seed-specific regulatory sequence; and 

(c) a third chimeric gene wherein a nucleic acid fragment encoding a 
lysine-rich protein wherein the weight percent lysine is at least 15% is operably 
linked to a seed-specific regulatory sequence. 

Further disclosed herein is an isolated nucleic acid fragment, and plants and 
seeds containing such fragment, comprising: 

(a) a first chimeric gene wherein a nucleic acid fragment comprising a 
nucleotide sequence essentially similar to the sequence shown in SEQ ID NO:l: 
encoding E. coli AKHI, said nucleic acid fragment encoding a lysine-insensitive 



variant of E. coli AKm and further characterized in that at least one of the 
following conditions is met: 

(1) the amino acid at position 318 is an amino acid other than 
threonine, or 

(2) the amino acid at position 352 is an amino acid other than 
methionine is operably linked to a plant chloroplast transit 
sequence and to a seed-specific regulatory sequence and 

(b) a second chimeric gene wherein a nucleic acid fragment derived 
from a bacteria encoding dihydrodipicolinic acid synthase is operably linked to a 
plant chloroplast transit sequence and to a seed-specific regulatory sequence; and 

(c) a third chimeric gene wherein a nucleic acid fragment encoding a 
lysine-rich protein comprising n heptad units (d e f g a b c), each heptad being 
either the same or different, wherein: 

n is at least 4; 

a and d are independendy selected from the group consisting of Met, Leu, 
Val, He and Thr; 

e and g are independently selected from the group consisting of the acid/base 
pairs Glu/Lys, Lys/Glu, Arg/Glu, Arg/Asp, Lys/Asp, Glu/Arg, Asp/Arg 
and Asp/Lys; and 

b, c and f are independently any amino acids except Gly or Pro and at least 
two amino acids of b, c and f in each heptad are selected from the 
group consisting of Glu, Lys, Asp, Arg, His, Thr, Ser, Asn, Ala, Gin 
and Cys, 

said nucleic acid fragment is operably linked to a seed-specific regulatory sequence. 

Further disclosed herein is an isolated nucleic acid fragment, and plants and 
seeds containing such fragment, comprising: 

(a) a first chimeric gene wherein a nucleic acid fragment comprising a 
nucleotide sequence essentially similar to the sequence shown in SEQ ID NO:l: 
encoding E. coli AKHI, said nucleic acid fragment encoding a lysine-insensitive 
variant of E. coli AKHI and further characterized in that at least one of the 
following conditions is met: 

(1) the amino acid at position 3 18 is an amino acid other than 
threonine, or 

(2) the amino acid at position 352 is an amino acid other than 
methionine is operably linked to a plant chloroplast transit 
sequence and to a seed-specific regulatory sequence; and 

(b) a second chimeric gene wherein a nucleic acid fragment derived 
from a bacteria encoding dihydrodipicolinic acid synthase is operably linked to a 
plant chloroplast transit sequence and to a seed-specific regulatory sequence; and 



(c) a third chimeric gene wherein a nucleic acid fragment encoding a 
lysine-rich protein having the amino acid sequence (MEEK1JCA)6(MEEKMKA)2 
is operably linked to a seed-specific regulatory sequence. 

Also disclosed are plants comprising in their genome the nucleic acid 
5 fragments listed above, plants comprising in their genomes each of the chimeric 
genes discribed above and seeds obtained from such plants. 

Also disclosed is a method for increasing the lysine content of the seeds of 
plants comprising: 

(a) transforming plant cells with the nucleic acid fragment listed 

10 above; 

(b) growing fertile mature plants from the transformed plant cells 
obtained from step (a) under conditions suitable to obtain seeds; and 

(c) selecting from the progeny seed of step (b) those seeds containing 
increased levels of lysine. 

15 The invention also includes a novel transformed plant, preferably a rapeseed 

or soybean plant, wherein the seeds of the plant accumulate lysine at a level at least 
ten percent higher than do seeds of an untransfoimed plant (10% to 400% higher 
for soybean and 10% to 100% higher for rapeseed) than do seeds of an 
untransformed plant. 

20 Further disclosed herein is a nucleic acid acid fragment wherein the seed- 

specific regulatory sequence is a monocot embyro-specific promoter, a monocot 
plant comprising in its genome such nucleic acid fragment and a seed obtained 
from that plant and comprising in its genome that nucleic acid fragment. 

Disclosed is a method for increasing the lysine content of the seeds of 
monocot plants comprising: 

(a) transforming plant cells with the nucleic acid fragment of 

Claim 33; 

(b) growing fertile mature plants from the transformed plant cells 
obtained from step (a) under conditions suitable to obtain seeds; and 

(c) selecting from the progeny seed of step (b) those seeds containing 
increased levels of lysine, and plants produced by such a method, wherein the 
plant is capable of transmitting said nucleic acid fragment to a progeny plant and 
wherein the progeny plant has the ability to produce levels of free lysine at least 
five times greater than free lysine levels of plants not containing the nucleic acid 
fragment. 

Also disclosed is a transformed com plant wherein the seeds of the plant 
accumulate lysine to a level between ten percent and one hundred thirty percent 
higher than do seeds of an untransformed plant and a method for increasing the 
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lysine content and reducing the accumulation of lysine breakdown products of the 
seeds of plants. 

Further disclosed is a method for increasing the lysine content and reducing 
the accumulation of lysine breakdown products of the seeds of plants comprising: 

(a) transforming plant cells with the nucleic acid fragment described 
herein, then 

(b) growing fertile mature plants from the transformed plant cells 
obtained from step (a) under conditions suitable to obtain seeds; 

(c) selecting from the progeny seed of step (b) those seeds containing 
increased levels of lysine; and lysine breakdown products and 

(d) introducing a mutation in the gene encoding lysine ketoglutarate 
reductase which reduces the enzyme activity and reduces accumulation of lysine 
breakdown products. 

BRIEF DESCRIPTION OF THE 
DRAWINGS AND SEQUENCE DESCRIPTIONS 

The invention can be more fully understood from the following detailed 
description and the accompanying drawings and the sequence descriptions which 
form a part of this application. 

Figure 1 shows an alpha helix from the side and top views. 

Figure 2 shows end (Figure 2a) and side (Figure 2b) views of an alpha helical 
coiled-coil structure. 

Figure 3 shows the chemical structure of leucine and methionine emphasizing 
their similar shapes. 

Figure 4 shows a schematic representation of gene expression cassettes. 

Figure 5 shows a map of the binary plasmid vector pZS97K. 

Figure 6 shows a map of the binary plasmid vector pZS97. 

Figure 7A shows a map of the binary plasmid vector pZS199; Figure 7B 
shows a map of the binary plasmid vector pFS926; Figure 7C shows a map of the 
binary plasmid vector pBT593; Figure 7D shows a map of the binary plasmid 
vector pBT597. 

Figure 8A shows a map of the plasmid vector pBT603; Figure 8B shows a 
map of the plasmid vector pBT614. 

Figure 9 shows the amino acid sequence similarity between the polypeptides 
encoded by two plant cDNAs and fungal SDH (glutamate-forming). 

Figure 10 depicts the strategy for creating a vector (pSK5) for use in 
construction and expression of the SSP gene sequences. 

Figure 1 1 shows the strategy for inserting oligonucleotide sequences into the 
unique Ear I site of the base gene sequence. 



Figure 12 shows the insertion of the base gene oligonucleotides into the 
Nco I/EcoR I sites of pSK5 to create the plasmid pSK6. This base gene sequence 
was used as in Figure 8 to insert the various SSP coding regions at the unique 
Ear I site to create the cloned seqments listed. 

Figure 13 shows the insertion of the 63 bp "segment" oligonucleotides used 
to create non-repetitive gene sequences for use in the duplication scheme in 
Figure 12. 

Figure 14 (A and B) shows the strategy for multiplying non-repetitive gene 
"segments" utilizing in-frame fusions. 

Figure 15 shows the vectors containing seed specific promoter and 3* 
sequence cassettes. SSP sequences were inserted into these vectors using the 
Nco I and Asp718 sites. 

Figure 16 shows a map of the plasmid vector pML63. 

Figure 17 shows a map of the plasmid vector pBT680. 

Figure 1 8 shows a map of the plasmid vector pBT68 1 . 

Figure 19 shows a map of the plasmid vector pLH104. 

Figure 20 shows a map of the plasmid vector pLH105. 

Figure 21 shows a map of the plasmid vector pBT739. 

Figure 22 shows a map of the plasmid vector pBT756. 

SEQ ID NO:l shows the nucleotide and amino acid sequence of the coding 
region of the wild type E. coli lysC gene, which encodes AKIII, described in 
Example 1. 

SEQ ID NOS:2 and 3 were used in Example 2 to create an Nco I site at the 
translation start codon of the E. col i lysC gene. 

SEQ ID NOS:4 and 5 were used in Example 3 as PCR primers for the 
isolation of the Corynebacterium dapA gene. 

SEQ ID NO:6 shows the nucleotide and amino acid sequence of the coding 
region of the wild type Corynebacterium dapA gene, which encodes lysine- 
insensitive DHDPS, described in Example 3. 

SEQ ID NO:7 was used in Example 4 to create an Nco I site at the 
translation start codon of the E. coli dapA gene. 

SEQ ID NOS:8, 9, 10 and 1 1 were used in Example 6 to create a chloroplast 
transit sequence and link the sequence to the E. coli IvsC . E. coli lysC -M4» E. coli 
dapA and Corynebacteria dapA genes. 

SEQ ID NOS:12 and 13 were used in Example 6 to create a Kpn I site 
immediately following the translation stop codon of the E. coli dapA gene. 
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SEQ ID NOS:14 and 15 were used in Example 6 as PCR primers to create a 
chloroplast transit sequence and link the sequence to the Corynebacterium dapA 
gene. 

SEQ ID NOS: 16-92 represent nucleic acid fragments and the polypeptides 
they encode that are used to create chimeric genes for lysine-rich synthetic seed 
storage proteins suitable for expression in the seeds of plants. 

SEQ ID NO:93 was used in Example 6 as a constitutive expression cassette 
for com. 

SEQ ID NOS:94-99 were used in Example 6 to create a com chloroplast 
transit sequence and link the sequence to the E . coli lysC -M4 gene. 

SEQ ID NOS: 100 and 101 were used in Example 6 as PCR primers to create 
a com chloroplast transit sequence and link the sequence to the E. coli dapA gene. 

SEQ ID NOS: 102 and 103 are cDNAs for plant lysine ketoglutarate 
ieductase/saccharopine dehydrogenase from Arabidopsis thaliana. 

SEQ ID NOS: 104 and 105 are polypeptides homologous to fungal 
saccharopine dehydrogenase (glutamate-forming) encoded by SEQ ID NOS: 102 
and 103, respectively. 

SEQ ID NOS: 106 and 107 were used in Example 25 as PCR primers to add 
Nco I and Kpn I sites at the 5 f and 3* ends of the com DHDPS gene. 

The Sequence Descriptions contain the one letter code for nucleotide 
sequence characters and the three letter codes for amino acids as defined in 
conformity with the IUPAC-IYUB standards described in Nucleic Acids Research 
13:3021-3030(1985) and in the Biochemical Journal 219 (No. 2):345-373(1984) 
which are incorporated by reference herein. 

DETAILED DESCRIPTION OF THE INVENTION 

The teachings below describe nucleic acid fragments and procedures useful 
for increasing the accumulation of lysine in the seeds of transformed plants, as 
compared to levels of lysine in untransfoimed plants. In order to increase the 
accumulation of free lysine in the seeds of plants via genetic engineering, a 
determination was made of which enyzmes in this pathway controlled the pathway 
in the seeds of plants. In order to accomplish this, genes encoding enzymes in the 
pathway were isolated from bacteria. In some cases, mutations in the genes were 
obtained so that the enzyme encoded was made insensitive to end-product 
inhibition. Intracellular localization sequences and suitable regulatory sequences 
for expression in the seeds of plants were linked to create chimeric genes. The 
chimeric genes were then introduced into plants via transformation and assessed 
for their ability to elicit accumulation of the lysine in seeds. 



Applicant has provided a unique first nucleic acid fragment comprised of two 
nucleic acid subfragments, one encoding AK which is substantially insensitive to 
inhibition by lysine and the other encoding DHDPS which is substantially 
insensitive to feedback inhibition by lysine. For the purposes of the present 
application the term substantially insensitive will mean at least 20-fold less sensitive 
to feedback inhibition by lysine than a typical plant enzyme catalyzing the same 
reaction. Applicant has found that a combination of subfragments successfully 
increases the lysine accumulated in seeds of transformed plants as compared to 
untransformed host plants. 

It has been discovered that the full potential for accumulation of excess free 
lysine in seeds is reduced by lysine catabolism. Furthermore, it has been 
discovered that lysine catabolism results in the accumulation of lysine breakdown 
products such as saccharopine and a-amino adipic acid. Provided herein are two 
alternative routes to reduce the loss of excess lysine due to catabolism and to 
reduce the accumulation of lysine breakdown products. In the first approach, 
lysine catabolism is prevented through reduction in the activity of the enzyme 
lysine ketoglutarate reductase (LKR), which catalyzes the first step in lysine 
breakdown. This can be accomplished by introducing a mutation that reduces or 
eliminates enzyme function in the plant gene that encodes LKR. Such mutations 
can be identified in lysine over-producer lines by screening mutants for a failure to 
accumulate the lysine breakdown products, saccharopine and a-amino adipic acid. 
Alternatively, several procedures to isolate plant LKR genes are provided; nucleic 
acid fragments containing plant LKR cDNAs are also provided. Chimeric genes 
for expression of antisense LKR RNA or for cosuppression of LKR in the seeds of 
plants can then be created. The chimeric LKR gene is linked to chimeric genes 
encoding lysine insensitive AK and DHDPS and all are introduced into plants via 
transformation simultaneously , or the chimeric genes are brought together by 
crossing plants transformed independently with each of the chimeric genes. 

In the second approach, excess free lysine is incorporated into a form that is 
insensitive to breakdown, e.g., by incorporating it into a di-, tri- or oligopeptide, or 
preferably a lysine-rich storage protein. The lysine-rich storage protein chosen 
should contain higher levels of lysine than average proteins. Ideally, these storage 
proteins should contain at least 15% lysine by weight. The design of a prefeixed 
class of polypeptides which can be expressed in vivo to serve as lysine-rich seed 
storage proteins is provided. Genes encoding the lysine-rich synthetic storage 
proteins (SSP) are synthesized and chimeric genes wherein the SSP genes are 
linked to suitable regulatory sequences for expression in the seeds of plants are 
created. The SSP chimeric gene is then linked to the chimeric DHDPS gene and 
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both are introduced into plants via transformation simultaneously, or the genes are 
brought together by crossing plants transformed independently with each of the 
chimeric gens. 

A method for transforming plants is taught herein wherein the resulting seeds 
of the plants have at least ten percent, preferably ten percent to four-fold greater, 
lysine than do the seeds of untransformed plants. Provided as examples herein are 
transformed rapeseed plants with seed lysine levels increased by 100% over 
untransformed plants and soybean plants with seed lysine levels increased by four- 
fold over lysine levels of untransformed plants. 

In the context of this disclosure, a number of terms shall be utilized. As used 
herein, the term "nucleic acid" refers to a large molecule which can be single- 
stranded or double-stranded, composed of monomers (nucleotides) containing a 
sugar, phosphate and either a purine or pyrimidine. A "nucleic acid fragment" is a 
fraction of a given nucleic acid molecule. In higher plants, deoxyribonucleic acid 
(DNA) is the genetic material while ribonucleic acid (RNA) is involved in the 
transfer of the information in DNA into proteins. A "genome" is the entire body of 
genetic material contained in each cell of an organism. The term "nucleotide 
sequence" refers to a polymer of DNA or RNA which can be single- or double- 
stranded, optionally containing synthetic, non-natural or altered nucleotide bases 
capable of incorporation into DNA or RNA polymers. 

As used herein, the term "homologous to" refers to the complementarity 
between the nucleotide sequence of two nucleic acid molecules or between the 
amino acid sequences of two protein molecules. Quantitative estimates of 
homology are provided by either DNA-DNA or DNA-RNA hybridization under 
conditions of stringency as is well understood by those skilled in the art [as 
described in Haines and Higgins (eds.) Nucleic Acid Hybridisation, IRL Press, 
Oxford, U.K.]; or by the comparison of sequence similarity between two nucleic 
acids or proteins. 

As used herein, "essentially similar" refers to DNA sequences that may 
involve base changes that do not cause a change in the encoded amino acid, or 
which involve base changes which may alter one or more amino acids, but do not 
affect the functional properties of the protein encoded by the DNA sequence. It is 
therefore understood that the invention encompasses more than the specific 
exemplary sequences. Modifications to the sequence, such as deletions, insertions, 
or substitutions in the sequence which produce silent changes that do not 
substantially affect the functional properties of the resulting protein molecule are 
also contemplated. For example, alteration in the gene sequence which reflect the 
degeneracy of the genetic code, or which result in the production of a chemically 
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equivalent amino acid at a given site, are contemplated; thus, a codon for the 
amino acid alanine, a hydrophobic amino acid, may be substituted by a codon 
encoding another less hydrophobic residue, such as glycine, or a more hydrophobic 
residue, such as valine, leucine, or isoleucine. Similarly, changes which result in 
substitution of one negatively charged residue for another, such as aspartic acid for 
glutamic acid, or one positively charged residue for another, such as lysine for 
arginine, can also be expected to produce a biologically equivalent product. 
Nucleotide changes which result in alteration of the N-terminal and C-terminal 
portions of the protein molecule would also not be expected to alter the activity of 
the protein. In some cases, it may in fact be desirable to make mutants of the 
sequence in order to study the effect of alteration on the biological activity of the 
protein. Each of the proposed modifications is well within the routine skill in the 
art, as is determination of retention of biological activity of the encoded products. 
Moreover, the skilled artisan recognizes that "essentially similar" sequences 
encompassed by this invention are also defined by their ability to hybridize, under 
stringent conditions (0.1X SSC, 0.1% SDS, 65°C), with the sequences exemplified 
herein. 

"Gene" refers to a nucleic acid fragment that expresses a specific protein, 
including regulatory sequences preceding (5 1 non-coding) and following (3' non- 
coding) the coding region. "Native" gene refers to the gene as found in nature with 
its own regulatory sequences. "Chimeric" gene refers to a gene comprising 
heterogeneous regulatory and coding sequences. "Endogenous" gene refers to the 
native gene normally found in its natural location in the genome. A "foreign" gene 
refers to a gene not normally found in the host organism but that is introduced by 
gene transfer. 

"Coding sequence" refers to a DNA sequence that codes for a specific 
protein and excludes the non-coding sequences. 

"Initiation codon" and "termination codon" refer to a unit of three adjacent 
nucleotides in a coding sequence that specifies initiation and chain termination, 
respectively, of protein synthesis (mRNA translation). "Open reading frame" refers 
to the amino acid sequence encoded between translation initiation and termination 
codons of a coding sequence. 

"RNA transcript" refers to the product resulting from RNA polymerase- 
catalyzed transcription of a DNA sequence. When the RNA transcript is a perfect 
complementary copy of the DNA sequence, it is referred to as the primary 
transcript or it may be a RNA sequence derived from posttranscriptional 
processing of the primary transcript. "Messenger RNA (mRNA) refers to RNA 
that can be translated into protein by the cell. "cDNA" refers to a double-stranded 
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DNA that is complementary to and derived from mRNA. "Sense" RNA refers to 
RNA transcript that includes the mRNA. " Antisense RNA" refers to a RNA 
transcript that is complementary to all or part of a target primary transcript or 
mRNA and that blocks the expression of a target gene by interfering with the 
processing, transport and/or translation of its primary transcript or mRNA. The 
complementarity of an antisense RNA may be with any part of the specific gene 
transcript, i.e., at the 5' non-coding sequence, 3' non-coding sequence, introns, or 
the coding sequence. In addition, as used herein, antisense RNA may contain 
regions of ribozyme sequences that increase the efficacy of antisense RNA to block 
gene expression. "Ribozyme" refers to a catalytic RNA and includes sequence- 
specific endoribonucleases. 

As used herein, suitable "regulatory sequences" refer to nucleotide sequences 
located upstream (5 f ), within, and/or downstream (3') to a coding sequence, which 
control the transcription and/or expression of the coding sequences, potentially in 
conjunction with the protein biosynthetic apparatus of the cell. These regulatory 
sequences include promoters, translation leader sequences, transcription 
termination sequences, and polyadenylation sequences. 

"Promoter" refers to a DNA sequence in a gene, usually upstream (5 f ) to its 
coding sequence, which controls the expression of the coding sequence by 
providing the recognition for RNA polymerase and other factors required for 
proper transcription. A promoter may also contain DNA sequences that are 
involved in the binding of protein factors which control the effectiveness of 
transcription initiation in response to physiological or developmental conditions. It 
may also contain enhancer elements. 

An "enhancer" is a DNA sequence which can stimulate promoter activity. It 
may be an innate element of the promoter or a heterologous element inserted to 
enhance the level and/or tissue-specificity of a promoter. "Constitutive promoters" 
refers to those that direct gene expression in all tissues and at all times. "Organ- 
specific" or "development-specific" promoters as referred to herein are those that 
direct gene expression almost exclusively in specific organs, such as leaves or 
seeds, or at specific development stages in an organ, such as in early or late 
embryogenesis, respectively. 

The term "operably linked" refers to nucleic acid sequences on a single 
nucleic acid molecule which are associated so that the function of one is affected 
by the other. For example, a promoter is operably linked with a structure gene 
(i.e., a gene encoding aspartokinase that is lysine-insensitive as given herein) when 
it is capable of affecting the expression of that structural gene (i.e., that the 
structural gene is under the transcriptional control of the promoter). 
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The term "expression", as used herein, is intended to mean the production of 
the protein product encoded by a gene. More particularly, "expression" refers to 
the transcription and stable accumulation of the sense (mRNA) or tha antisense 
RNA derived from the nucleic acid fragment(s) of the invention that, in conduction 
5 with the protein apparatus of the cell, results in altered levels of protein product. 
"Antisense inhibition" refers to the production of antisense RNA transcripts 
capable of preventing the expression of the target protein. "Overexpression" refers 
to the production of a gene product in transgenic organisms that exceeds levels of 
production in normal or non-transformed organisms. "Cosuppression" refers to the 
expression of a foreign gene which has substantial homology to an endogenous 
gene resulting in the suppression of expression of both the foreign and the 
endogenous gene. "Altered levels" refers to the production of gene product(s) in 
transgenic organisms in amounts or proportions that differ from that of normal or 
non-transformed organisms. 

The "3' non-coding sequences" refers to the DNA sequence portion of a gene 
that contains a polyadenylation signal and any other regulatory signal capable of 
affecting mRNA processing or gene expression. The polyadenylation signal is 
usually characterized by affecting the addition of poly adenylic acid tracts to the 3 1 
end of the mRNA precursor. 

The "translation leader sequence" refers to that DNA sequence portion of a 
gene between the promoter and coding sequence that is transcribed into RNA and 
is present in the fully processed mRNA upstream (5') of the translation start codon. 
The translation leader sequence may affect processing of the primary transcript to 
mRNA, mRNA stability or translation efficiency. 

"Mature" protein refers to a post-translationally processed polypeptide 
without its targeting signal. "Precursor" protein refers to the primary product of 
translation of mRNA. A "chloroplast targeting signal" is an amino acid sequence 
which is translated in conjunction with a protein and directs it to the chloroplast. 
"Chloroplast transit sequence" refers to a nucleotide sequence that encodes a "H 



chloroplast targeting signal. 

"Transformation" herein refers to the transfer of a foreign gene into the 
genome of a host organism and its genetically stable inheritance. Examples of 
methods of plant transformation include Agrobacterium-mediaxed transformation 
and particle-accelerated or "gene gun" transformation technology. 

"Amino acids" herein refer to the naturally occuring L amino acids (Alanine, 
Arginine, Aspartic acid, Asparagine, Cystine, Glutamic acid, Glutamine, Glycine, 
Histidine, Isoleucine, Leucine, Lysine, Methionine, Proline, Phenylalanine, Serine, 
Threonine, Tryptophan, Tyrosine, and Valine). "Essential amino acids" are those 
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amino acids which cannot be synthesized by animals. A "polypeptide" or "protein" 
as used herein refers to a molecule composed of monomers (amino acids) linearly 
linked by amide bonds (also known as peptide bonds). 

"Synthetic protein" herein refers to a protein consisting of amino acid 
5 sequences that are not known to occur in nature. The amino acid sequence may be 
derived from a consensus of naturally occuring proteins or may be entirely novel. 

"Primary sequence" refers to the connectivity order of amino acids in a 
polypeptide chain without regard to the conformation of the molecule. Primary 
sequences are written from the amino terminus to the carboxy terminus of the 
10 polypeptide chain by convention. 

"Secondary structure" herein refers to physico-chemically favored regular 
backbone arrangements of a polypeptide chain without regard to variations in side 
chain identities or conformations. "Alpha helices" as used herein refer to right- 
handed helices with approximately 3.6 residues residues per turn of the helix. An 
15 "amphipathic helix" refers herein to a polypeptide in a helical conformation where 
one side of the helix is predominantly hydrophobic and the other side is 
predominantly hydrophilic. 

"Coiled-coil" herein refers to an aggregate of two parallel right-handed alpha 
helices which are wound around each other to form a left-handed superhelix. 

"Salt bridges" as discussed here refer to acid-base pairs of charged amino 
acid side chains so arranged in space that an attractive electrostatic interaction is 
maintained between two parts of a polypeptide chain or between one chain and 
another. 

"Host cell" means the cell that is transformed with the introduced genetic 
material. 

Isolation of AK Genes 
The E. coli lysC gene has been cloned, restriction endonuclease mapped and 
sequenced previously [Cassan et al. (1986) /. BioL Chem. 2(51:1052-1057]. For 
the present invention the lysC gene was obtained on a bacteriphage lambda clone 
from an ordered library of 3400 overlapping segments of cloned £. coli DNA 
constructed by Kohara, Akiyama and Isono [Kohara et al. (1987) Cell 
50:595-508]. The E. coli lysC gene encodes the enzyme AKUI, which is sensitive 
to lysine inhibition. Mutations were obtained in the lysC gene that cause the AKUI 
enzyme to be resistant to lysine. 

To determine the molecular basis for lysine-resistance,the sequence of the 
wild type lysC gene and three mutant genes were determined. The sequence of the 
cloned wild type IvsC gene, indicated in SEQ ID NO:l:, differed from the 
published lysC sequence in the coding region at 5 positions. 



17 



The sequences of the three mutant lvsC genes that encoded lysine-insensitive 
aspartokinase each differed from the wild type sequnce by a single nucleotide, 
resulting in a single amino acid substitution in the protein. One mutant (M2) had 
an A substituted for a G at nucleotide 954 of SEQ ID NO: 1 : resulting in an 
isoleucine for methionine substitution in the amino acid sequence of AKTQ and two 
mutants (M3 and M4) had identical T for C substitutions at nucleotide 1055 of 
SEQ ID NO:l resulting in an isoleucine for threonine substitution. 

Other mutations could be generated, either in vivo as described in Example 1 
or in vitro by site-directed mutagenesis by methods known to those skilled in the 
art, that result in amino acid substitutions for the methionine or threonine residue 
present in the wild type AKIII at these positions. Such mutations would be 
expected to result in a lysine-insensitive enzyme. Furthermore, the method 
described in Example 1 could be used to easily isolate and characterize as many 
additional mutant lysC genes encoding lysine insensitive AKIII as desired. 

A number of other AK genes have been isolated and sequenced. These 
include the thrA gene of E. coli (Katinka et al. (1980) Proc. Natl. Acad. Sci. USA 
77:5730-5733], the metL gene ofE. coli (Zakin et al. (1983) /. Biol. Chem. 
255:3028-3031], the HOM3 gene of 5. cerevisiae [Rafalski et al. (1988) /. Biol. 
Chem. 2(55:2146-2151]. The thrA gene of E. coli encodes a bifunctional protein, 
AKI-HDHI. The AK activity of this enzyme is insensitive to lysine, but sensitive to 
threonine. The metL gene of E. coli also encodes a bifunctional protein, 
AKH-HDHII, and the AK activity of this enzyme is also insensitive to lysine. The 
HOM3 gene of yeast encodes an AK which is insensitive to lysine, but sensitive to 
threonine. 

In addition to these genes, several plant genes encoding lysine-insensitive AK 
are known. In barley lysine plus threonine-resistant mutants bearing mutations in 
two unlinked genes that result in two different lysine-insensitive AK isoenzymes 
have been described [Bright et al. (1982) Nature 299:278-279, Rognes et al. 
(1983) Planta 757:32-38, Arruda et al. (1984) Plant Phsiol. 7(5:442-446]. In com, 
a lysine plus threonine-resistant cell line had AK activity that was less sensitive to 
lysine inhibition than its parent line [Hibberd et al. (1980) Planta 745:183-187]. A 
subsequently isolated lysine plus threonine-resistant com mutant is altered at a 
different genetic locus and also produces lysine-insensitive AK [Diedrick et al. 
(1990) Theor. Appl. Genet. 79/209-215, Dotson et al. (1990) Planta 
752:546-552]. In tobacco there are two AK enzymes in leaves, one lysine- 
sensitive and one threonine-sensitive. A lysine plus threonine-resistant tobacco 
mutant that expressed completely lysine-insensitive AK has been described 
[Frankard et al. (1991 ) Theor. Appl. Genet. 52:273-282]. These plant mutants 
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could serve as sources of genes encoding lysine -insensitive AK and used, based on 
the teachings herein, to increase the accumulation of lysine and threonine in the 
seeds of transformed plants. 

A partial amino acid sequence of AK from carrot has been reported [Wilson 
et al. (1991) Plant Physiol, 97:1323:1328]. Using this infoimation a set of 
degenerate DNA oligonucleotides could be designed, synthesized and used as 
hybridization probes to permit the isolation of the carrot AK gene. Recently the 
carrot AK gene has been isolated and its nucleotide sequence has been determined 
[Matthews et al. (1991) U.S.S.N. 07/746,705]. This gene can be used as a 
heterologous hybridization probe to isolate the genes encoding lysine-insensitive 
AK described above. 

High level expression of wild type and 
mutant lvsC genes in E. coli 
To achieve high level expression of the lysC genes in E. coli, a bacterial 
expression vector which employs the bacteriphage T7 RNA poIymerase/T7 
promoter system [Rosenberg et al. (1987) Gene 5(5:125-135] was used. The 
expression vector and lysC gene were modified as described in Example 2 to 
construct a lysC expression vector. For expression of the mutant lysC genes (M2, 
M3 and M4), the wild type lysC gene was replaced with the mutant genes as 
described in Example 2. 

For high level expression, each of the expression vectors was transformed 
into E. coli strain B121(DE3) [Studier et al. (1986)/. MoL Biol 759:113-130]. 
Cultures were grown, expression was induced, cells were collected, and extracts 
were prepared as described in Example 2. Supernatant and pellet fractions of 
extracts from uninduced and induces cultures were analyzed by SDS 
polyacrylamide gel electrophoresis and by AK enzyme assays as described in 
Example 2. The major protein visible by Coomassie blue staining in the 
supernatant and pellet fractions of induced cultures was AKHL About 80% of the 
AKDI protein was in the supernatant and AKHI represented 10-20% of the total 
E. coli protein in the extract. 

Approximately 80% of the AKHI enzyme activity was in the supernatant 
fraction. The specific activity of wild type and mutant crude extracts was 
5-7 pinoles product per minute per milligram total protein. Wild type AKHI was 
sensitive to the presence of L-lysine in the assay. Fifty percent inhibition was 
found at a concentration of about 0.4 mM and 90 percent inhibition at about 
0.1 mM. In contrast, mutants AKDI-M2, M3 and M4 were not inhibited at all by 
15 mM L-lysine. 
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Wild type AKJH protein was purified from the supernatant of an induced 
culture as described in Example 2. Rabbit antibodies were raised against the 
purified AKm protein. 

Many other microbial expression vectors have been described in the 
literature. One skilled in the art could make use of any of these to construct lvsC 
expression vectors. These lysC expression vectors could then be introduced into 
appropriate microorganisms via transformation to provide a system for high level 
expression of AKHL 

Isolation of DHDPS genes 
The E. coli dapA gene (eco dapA ) has been cloned, restriction endonuclease 
mapped and sequenced previously [Richaud et al. (1986) /. BacterioL 
7(5(5:297-300]. For the present invention the dapA gene was obtained on a 
bacteriophage lambda clone from an ordered library of 3400 overlapping segments 
of cloned E. coli. DNA constructed by Kohara, Akiyama and Isono [Kohara et al. 
(1987) Cell 50:595-508]. The eco dapA gene encodes a DHDPS enzyme that is 
sensitive to lysine inWbition. However, it is about 20-fold less sensitive to 
inhibition by lysine than a typical plant DHDPS, e.g., wheat germ DHDPS. 

The Corynebacterium dapA gene (co rdapA ) was isolated from genomic 
DNA from ATCC strain 13032 using polymerase chain reaction (PCR). The 
nucleotide sequence of the Corynebacterium dapA gene has been published 
[Bonnassie et al. (1990) Nucleic Acids Res. 18:6421]. From the sequence it was 
possible to design oligonucleotide primers for polymerase chain reaction (PCR) 
that would allow amplification of a DNA fragment containing the gene, and at the 
same time add unique restriction endonuclease sites at the start codon and just past 
the stop codon of the gene to facilitate further constructions involving the gene. 
The details of the isolation of the co rdapA gene are presented in Example 3. The 
co rdapA gene encodes a DHDPS enzyme that is insensitive to lysine inhibiton. 

In addition to introducing a restriction endonuclease site at the translation 
start codon, the PCR primers also changed the second codon of the co rdapA gene 
from AGC coding for serine to GCT coding for alanine. Several cloned DNA 
fragments that expressed active, lysine-insensitive DHDPS were isolated, indicating 
that the second codon amino acid substitution did not affect enzyme activity. 

The PCR-generated Corynebacterium dapA gene was subcloned into the 
phagemid vactor pGEM-9zf(-) from Promega, and single-stranded DNA was 
generated and sequenced (SEQ ID NO:6). Aside from the differenced in the 
second codon already mentioned, the sequence matched the published sequence 
except at two positions, nucleotides 798 and 799. In the published sequence these 
are TC, while in the gene shown in SEQ ID NO: 6 they are CT. This change results 
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in an amino acio^titution of leucine for senne. The re for this difference is 
not known. The difference has no apparent effect on DHDPS enzyme activity. 

The isolation of other genes encoding DHDPS has been described in the 
literature. A cDNA encoding DHDPS from wheat [Kaneko et al. (1990) /. Biol. 
Chem. 265: 17451-17455], and a cDNA encoding DHDPS from corn [Frisch et al. 
(1991) Mol. Gen. Genet. 225:287-293] are two examples. These genes encode 
wild type lysine-sensitive DHDPS enzymes. However, Negrutui et al. [(1984) 
Theor. Appl. Genet. 68: 1 1-20], obtained two AEC-resistant tobacco mutants in 
which DHDPS activity was less sensitive to lysine inhibition than the wild type 
enzyme. These genes could be isolated using the methods already described for 
isolating the wheat or com genes or, alternatively, by using the wheat or corn 
genes as heterologous hybridization probes. 

Still other genes encoding DHDPS could be isolated by one skilled in the art 
by using either the ecodapA gene, me cordapA gene, or either of the plant DHDPS 
genes as DNA hybridization probes. Alternatively, other genes encoding DHDPS 
could be isolated by functional complementation of an E. coli djpA mutant, as was 
done to isolate the cordapA gene [Yeh et al. (1988) Mol. Gen. Genet. 
212: 105-1 11] and the corn DHDPS gene. 

High level expression pi ecodapA and 
cordap A penes in E. coli 
To achieve high level expression of the ecodapA and cordapA genes in 
E coli, a bacterial expression vector which employs the bacteriophage T7 RNA 
polymerase/^ promoter system [Rosenberg et al. (1987) Gene 56:127-135] was 
used. The vector and dapA genes were modified as described below to construct 
ecodapA and cordapA expression vectors. 

For high level expression each of the expression vectors was transformed 
into E. coli strain BL21(DE3) [Studier et al. (1986)/. Mol. Biol. 759:113-130]. 
Cultures were grown, expression was induced, cells were collected, and extracts 
were prepared as described in Example 4. Supernatant and pellet fractions of 
extracts from uninduced and induced cultures were analyzed by SDS 
polyacrylamide gel electrophoresis and by DHDPS enzyme assays as described in 
Example 4. The major protein visible by Coomassie blue staining in the 
supernatant and pellet fractions of both induced cultures had a molecular weight of 
32-34 kd, the expected size for DHDPS. Even in the uninduced cultures this 
protein was the most prominent protein produced. 

In the induced culture with the ecodapA gene about 80% of the DHDPS 
protein was in the supernatant and DHDPS represented 10-20% of the total 
protein in the extract. In the induced culture with the cordapA gene more than 
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50% of the DHE^^rotein was in the pellet fraction. ThS^Rlet fractions in both 
cases were 90-95% pure DHDPS, with no other single protein present in 
significant amounts. Thus, these fractions were pure enough for use in the 
generation of rabbit antibodies. 
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The specific activity of E, coli DHDPS in the supernatant fraction of induced 



extracts was about 50 OD540 units per milligram protein. E. coli DHDPS was 
sensitive to the presence of L-lysine in the assay. Fifty percent inhibition was 
found at a concentration of about 0.5 mM. For Corynebacteriwn DHDPS, 
enzyme activity was measured in the supernatant fraction of uninduced extracts, 

10 rather than induced extracts. Enzyme activity was about 4 OD530 units per minute 
per milligram protein. In contrast to E. coli DHDPS, Corynebacteriwn DHDPS 
was not inhibited at all by L-lysine, even at a concentration of 70 mM. 

Many other microbial expression vectors have been described in the 
literature. One skilled in the art could make use of any of these to construct 

15 ec odapA or co rdapA expression vectors. These expression vectors could then be 
introduced into appropriate microorganisms via transformation to provide a system 
for high level expression of DHDPS. 



The E. coli expression cassettes were inserted into expression vectors and 
then transformed into E. coli strain BL21(DE3) [Studier et al. (1986) /. Mol BioL 
259:113-130] to induce E. coli to produce and excrete amino acids. Details of the 
procedures used and results are presented in Example 5. 

Other microbial expression vectors known to those skilled in the art could be 
used to make and combine expression cassettes for the lysC and dapA genes. 
These expression vectors could then be introduced into appropriate 
microorganisms via transformation to provide alternative systems for production 
and excretion of lysine, theronine and methionine. 



A preferred class of heterologous hosts for the expression of the chimeric 
genes of this invention are eukaryotic hosts, particularly the cells of higher plants. 
Preferred among the higher plants and the seeds derived from them are soybean, 
rapeseed (Brassica napus, B. campestris), sunflower (Helianthus annus), cotton 
(Gossypium hirsutum), com, tobacco (Nicotiana Tubacum), alfalfa {Medicago 
sativa), wheat (Triticum sp), barley {Hordeum vulgare), oats (Avena sativa, L), 
sorghum (Sorghum bicolor), rice (Oryza sativa), and forage grasses. Expression in 
plants will use regulatory sequences functional in such plants. The expression of 
foreign genes in plants is well-established [De Blaere et al. (1987) Meth. EnzymoL 



Excretion of amino acids by E. coli expressing 
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143:211-291]. ^^^ r * eve l of expression of the different eric genes of this 
invention in plant cells may be achieved through the use of many different 
promoters. Such chimeric genes can be transferred into host plants either together 
in a single expression vector or sequentially using more than one vector. 

The origin of promoter chosen to drive the expression of the coding 
sequence is not critical as long as it has sufficient transcriptional activity to 
accomplish the invention by expressing translatable mRNA or antisense RNA in the 
desired host tissue. Preferred promoters for expression in all plant organs, and 
especially for expression in leaves include those directing the 19S and 35S 
transcripts in Cauliflower mosaic virus [Odell et aL(1985) Nature 373:810-812; 
Hull et al. (1987) Virology 5(5:482-493], small subunit of ribulose 1,5-bisphosphate 
carboxylase [Morelli et al.(1985) Nature 375:200; Broglie et al. (1984) Science 
224:838; Hererra-Estrella et al.( 1 984) Nature 310:115; Coruzzi et al.(1984) 
EMBO J. 3:1671; Faciotti et al.(1985) Bio/Technology 3:241], maize zein protein 
[Matzke et al.(1984) EMBO J. 3:1525], and chlorophyll a/b binding protein 
[Lampa et al.(1986) Nature 37(5:750-752]. 

Depending upon the application, it may be desirable to select promoters that 
are specific for expression in one or more organs of the plant. Examples include 
the light-inducible promoters of the small subunit of ribulose 1 ,5-bisphosphate 
carboxylase, if the expression is desired in photosynthetic organs, or promoters 
active specifically in seeds. 

Preferred promoters are those that allow expression specifically in seeds. 
This may be especially useful, since seeds are the primary source of vegetable 
amino acids and also since seed-specific expression will avoid any potential 
deleterious effect in non-seed organs. Examples of seed-specific promoters 
include, but are not limited to, the promoters of seed storage proteins. The seed 
storage proteins are stricdy regulated, being expressed almost exclusively in seeds 
in a highly organ-specific and stage-specific manner [Higgins et aL(1984) Ann. 
Rev. Plant Physiol. 35:191-221; Goldberg et al.(1989) Cell 56: 149-160; 
Thompson et al. (1989) BioEssays 7(7:108-1 13]. Moreover, different seed storage 
proteins may be expressed at different stages of seed development. 

There are currently numerous examples for seed-specific expression of seed 
storage protein genes in transgenic dicotyledonous plants. These include genes 
from dicotyledonous plants for bean P-phaseolin [Sengupta-Goplalan et al. (1985) 
Proc. Natl. Acad. Sci. USA 52:3320-3324; Hoffinan et al. (1988) Plant Mol. Biol. 
77:717-729], bean lectin [VoeUcer et al. (1987) EMBO J. 6: 3571-3577], soybean 
lectin [Okamuro et al. (1986) Proc. Natl. Acad. Sci. USA 53:8240-8244], soybean 
kunitz trypsin inhibitor [Perez-Grau et al. (1989) Plant Cell 7:095-1109], soybean 
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p-conglycinin [B^Iy et al. (1985) EMBO J. 4:3047-305^trker et al. (1988) 
Proc. Natl. Acad. Sci. USA 55:458-462; Chen et al. (1988) EMBO J. 7:297-302; 
Chen et al. (1989) Dev. Genet. 10: 1 12-122; Naito et al. (1988) Plant Mol. Biol. 
77:109-123], pea victim [Higgins et al. (1988) Plant Mol. Biol. 77:683-695], pea 
convicilin [Newbigin et al. (1990) Planta 750:461], pea legumin [Shirsat et al. 
(1989) MoL Gen. Genetics 275:326]; rapeseed napin [Radke et al. (1988) Theor. 
Appl. Genet. 75:685-694] as well as genes from monocotyledonous plants such as 
for maize 15 kD zein [Hoffinan et al. (1987) EMBO /. (5:3213-3221; Schernthaner 
et al. (1988) EMBO J. 7:1249-1253; Williamson et al. (1988) Plant PhysioL 
55:1002-1007], barley p-hordein [Marris et al. (1988) Plant Mol. Biol. 
70:359-366] and wheat glutenin [Colot et al. (1987) EMBO J. (5:3559-3564]. 
Moreover, promoters of seed-specific genes, operably linked to heterologous 
coding sequences in chimeric gene constructs, also maintain their temporal and 
spatial expression pattern in transgenic plants. Such examples include Arabidopsis 
thaliana 2S seed storage protein gene promoter to express enkephalin peptides in 
Arabidopsis and B. napus seeds [Vandekerckhove et al. (1989) Bio/Technology 
7:929-932], bean lectin and bean P-phaseolin promoters to express luciferase 
[Riggs et al. (1989) Plant Sci. (55:47-57], and wheat glutenin promoters to express 
chloramphenicol acetyl transferase [Colot et al. (1987) EMBO J. 6:3559-3564]. 

Of particular use in the expression of the nucleic acid fragment of the 
invention will be the heterologous promoters from several extensively- 
characterized soybean seed storage protein genes such as those for the Kunitz 
trypsin inhibitor [Jofuku et al. (1989) Plant Cell 7:1079-1093; Perez-Grau et al. 
(1989) Plant Cell 7: 1095-1 109], glycinin [Nielson et al. (1989) Plant Cell 
7:313-328], p-conglycinin [Harada et al. (1989) Plant Cell 7:415-425]. Promoters 
of genes for a ? - and p-subunits of soybean P-conglycinin storage protein will be 
particularly useful in expressing mRNAs or antisense RNAs in the cotyledons at 
mid- to late-stages of soybean seed development [Beachy et al. (1985) EMBO J. 
4:3047-3053; Barker et al. (1988) Proc. Natl. Acad. Sci. USA 55:458-462; Chen 
et al. (1988) EMBO J. 7:297-302; Chen et al. (1989) Dev. Genet. 70:112-122; 
Naito et al. (1988) Plant Mol. Biol. 77:109-123] in transgenic plants, since: 

a) there is very little position effect on their expression in transgenic seeds, and 

b) the two promoters show different temporal regulation: the promoter for the 
a'-subunit gene is expressed a few days before that for the P-subunit gene. 

Also of particular use in the expression of the nucleic acid fragments of the 
invention will be the heterologous promoters from several extensively 
characterized com seed storage protein genes such as endosperm-specific 
promoters from the 10 kD zein [Kirihara et al. (1988) Gene 77:359-370], the 
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27 kD zein [Pra^pLl. (1987) Gene 52:51-49; Gallardo eJ^[1988) Plant Sci. 
54:21 1-28 1], and the 19 kD zein [Marks et al. (1985) /. Biol. Chem. 
260: 16451-16459]. The relative transcriptional activities of these promoters in 
com have been reported [Kodrzyck et al. (1989) Plant Cell 1: 105-1 14] providing 
5 basis for choosing a promoter for use in chimeric gene constructs for com. For 
expression in com embryos, the strong embryo-specific promoter from the GLB1 
gene [Kriz (1989) Biochemical Genetics 27:239-251, Wallace et al. (1991) Plant 
PhysioL 95:973-975] can be used. 

It is envisioned that the introduction of enhancers or enhancer-like elements 
10 into other promoter constructs will also provide increased levels of primary 

transcription to accomplish the invention. These would include viral enhancers 
such as that found in the 35S promoter [Odell et al. (1988) Plant Mol. BioL 
70:263-272], enhancers from the opine genes [Fromm et al. (1989) Plant Cell 
7:977-984], or enhancers from any other source that result in increased 
15 transcription when placed into a promoter operably linked to the nucleic acid 
fragment of the invention. 

Of particular importance is the DNA sequence element isolated from the 
gene for the a -subunit of (3-conglycinin that can confer 40-fold seed-specific 
enhancement to a constitutive promoter [Chen et al. (1988) EMBO /. 7:297-302; 
Chen et al. (1989) Dev. Genet. 10:1 12-122]. One skilled in the art can readily 
isolate this element and insert it within the promoter region of any gene in order to 
obtain seed-specific enhanced expression with the promoter in transgenic plants. 
Insertion of such an element in any seed-specific gene that is expressed at different 
times than the J3-conglycinin gene will result in expression in transgenic plants for a 
longer period during seed development. 

Any 3' non-coding region capable of providing a polyadenylation signal and 
other regulatory sequences that may be required for the proper expression can be 
used to accomplish the invention. This would include the 3' end from any storage 
protein such as the 3' end of the bean phaseolin gene, the 3' end of the soybean 
p-conglycinin gene, the 3' end from viral genes such as the 3' end of the 35S or the 
19S cauliflower mosaic virus transcripts, the 3' end from the opine synthesis genes, 
the 3' ends of ribulose 1,5-bisphosphate carboxylase or chlorophyll a/b binding 
protein, or 3* end sequences from any source such that the sequence employed 
provides the necessary regulatory information within its nucleic acid sequence to 
result in the proper expression of the promoter/coding region combination to 
which it is operably linked. There are numerous examples in the art that teach the 
usefulness of different 3' non-coding regions [for example, see Ingelbrecht et al. 
(1989) Plant Cell i:671-680]. 
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to the lysC and dapA coding sequence if required for the proper expression of the 
proteins to accomplish the invention. Plant amino acid biosynthetic enzymes are 
known to be localized in the chloroplasts and therefore are synthesized with a 
5 chloroplast targeting signal. Bacterial proteins such as DHDPS and AKIH have no 
such signal. A chloroplast transit sequence could, therefore, be fused to the dapA 
and lysC coding sequences. Preferred chloroplast transit sequences are those of 
the small subunit of ribulose 1,5-bisphosphate carboxylase, e.g. from soybean 
[Berry-Lowe et al. (1982) /. Mol. Appl. Genet. 7:483^98] for use in 
10 dicotyledonous plants and from com [Lebrun et al. (1987) Nucleic Acids Res. 
75:4360] for use in monocotyledonous plants. 

Introduction of Chimeric Genes into Plants 
Various methods of introducing a DNA sequence (Le., of transforming) into 
eukaryotic cells of higher plants are available to those skilled in the art (see EPO 
15 publications 0 295 959 A2 and 0 138 341 Al). Such methods include those based 
on transformation vectors based on the Ti and Ri plasmids of Agrobacterium spp. 
It is particularly preferred to use the binary type of these vectors. Ti-derived 
vectors transform a wide variety of higher plants, including monocotyledonous and 
dicotyledonous plants, such as soybean, cotton and rape [Pacciotti et al. (1985) 
Bio/Technology 5:241; Byrne et al. (1987) Plant Cell, Tissue and Organ Culture 
8:3; Sukhapinda et al. (1987) Plant Mol. Biol. 5:209-216; Lorz et al. (1985) MoL 
Gen. Genet. 799:178; Potrykus (1985) MoL Gen. Genet. 799:183]. 

For introduction into plants the chimeric genes of the invention can be 
inserted into binary vectors as described in Examples 7-12 and 14-16. The vectors 
are part of a binary Ti plasmid vector system [Bevan, (1984) Nucl. Acids. Res. 
72:8711-8720] of Agrobacterium tumefaciens. 

Other transformation methods are available to those skilled in the art, such as 
direct uptake of foreign DNA constructs [see EPO publication 0 295 959 A2], 
techniques of electroporation [see Fromm et al. (1986) Nature (London) 579:791] 
or high- velocity ballistic bombardment with metal particles coated with the nucleic 
acid constructs [see Kline et al. (1987) Nature (London) 527:70, and see U.S. Pat. 
No. 4,945,050]. Once transformed, the cells can be regenerated by those skilled in 
the art. 

Of particular relevance are the recently described methods to transform 
foreign genes into commercially important crops, such as rapeseed [see De Block 
et al. (1989) Plant Physiol. 97:694-701], sunflower [Everett et al. (1987) 
Bio/Technology 5:1201], soybean [McCabe et al. (1988) Bio/Technology 6:923; 
Hinchee et al. (1988) Bio/Technology (5:915; Chee et al. (1989) Plant Physiol 
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97:1212-1218; ft>tou et al. (1989) Proc. Natl. Acad. S^SA 86:7500-7504; 
EPO Publication 0 301 749 A2], and com [Gordon-Kamm et al. (1990) Plant Cell 
2:603-618; Fromm et al. (1990) Biotechnology 8:833-839]. 

For introduction into plants by high-velocity ballistic bombardment, the 
chimeric genes of the invention can be inserted into suitable vectors as described in 
Example 6. Transformed plants can be obtained as described in Examples 17-19. 
Re pression of lvsC and dapA Chimeric Genes 
in Tobacco Plants 

To assay for expression of the chimeric genes in leaves or seeds of the 
transformed plants, the AKffl or DHDPS proteins can be detected and quantitated 
enzymatically and/or immunologically by methods known to those skilled in the art. 
In this way lines producing high levels of expressed protein can be easily identified 

In order to measure the free amino acid composition of the leaves, free 
amino acids can be extracted by various methods including those as described in 
Example 7. To measure the free or total amino acid composition of seeds, extracts 
can be prepared by various methods including those as described in Example 8. 

There was no significant effect of expression of AKffl or AKHTM4 (with a 
chloroplast targeting signal) on the free lysine or threonine (or any other amino 
acid) levels in the leaves (see Table 2 in Example 7). Since AKffl-M4 is insensitive 
to feedback inhibition by any of the end-products of the pathway, this indicates that 
control must be exerted at other steps in the biosynthetic pathway in leaves. 

In contrast, expression of the AKffl or AKHI-M4 (with a chloroplast 
targeting signal) in the seeds resulted in 2 to 4-fold or 4 to 23-fold increases, 
respectively, in the level of free threonine in the seeds compared to untransformed 
plants and 2 to 3-fold increases in the level of free lysine in some cases (Table 3, 
Example 8). There was a good correlation between transform ants expressing 
higher levels of AKffl or AKIII-M4 protein and those having higher levels of free 
threonine, but this was not the case for lysine. The relatively small increases of free 
threonine or lysine achieved with the AKffl protein were not sufficient to yield 
detectable increases compared to untransformed plants, in the levels of total 
threonine or lysine in the seeds. The larger increases of free threonine achieved via 
expression of the AKHI-M4 protein were sufficient to yield detectable increases, 
compared to seeds from untransformed plants, in the levels of total threonine in the 
seeds. Sixteen to twenty-five percent increases in total threonine content of the 
seeds were observed. The lines that showed increased total threonine were the 
same ones the showed the highest levels of increase in free threonine and high 
expression of the AKIII-M4 protein. 
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The above ^rnings show that amino acid biosynth^Wakes place in seeds 
and can be modulated by the expression of foreign genes encoding amino acid 
biosynthetic enzymes. Furthermore, they show that control of an amino acid 
biosynthetic pathway can differ markedly from one plant organ to another, e.g. 
seeds and leaves. The importance of this observation is emphasized upon 
considering the different effects of expressing a foreign DHDPS in leaves and seeds 
described below. It can be concluded that threonine biosynthesis in seeds is 
controlled primarily via end-product inhibition of AK. Therefore, threonine 
accumulation in the seeds of plants can be increased by expression of a gene, 
introduced via transformation, that encodes AK which is insensitive to lysine 
inhibition and which is localized in the chloroplast. 

The above teachings also demonstrate that transformed plants which express 
higher levels of the introduced enzyme in seeds accumulate higher levels of free 
threonine in seeds. Furthermore, the teachings demonstrate that transformed 
plants which express a lysine-insensitive AK in seeds accumulate higher levels of 
free threonine in seeds than do transformed plants which express similar levels of a 
lysine-sensitive AK. To achieve commercially valuable increases in free threonine, 
a lysine-insensitive AK is preferred. 

These teachings indicate that the level of free lysine in seeds controls the 
accumulation of another aspartate-derived amino acid, threonine, through end- 
product inhibition of AK. In order to accumulate high levels of free lysine itself, it 
will be necessary to bypass lysine inhibition of AK via expression of a lysine- 
insensitive AK. 

Expression of active E. coli DHDPS enzyme was achieved in both young and 
mature leaves of the transformed tobacco plants (Table 4, Example 9). High levels 
of free lysine, 50 to 100-fold higher than normal tobacco plants, accumulated in the 
young leaves of the plants expressing the enzyme with a chloroplast targeting 
signal, but not without such a targeting signal. However, a much smaller 
accumulation of free lysine (2 to 8-fold) was seen in the larger leaves. Experiments 
that measure lysine in the phloem suggest that lysine is exported from the large 
leaves. This exported lysine may contribute to the accumulation of lysine in the 
small growing leaves, which are known to take up, rather than export nutrients. 
No effect on the free lysine levels in the seeds of these plants was observed even 
though E. coli DHDPS enzyme was expressed in the seeds as well as the leaves. 

High level seed-specific expression of E, coli DHDPS enzyme, either with or 
without a chloroplast targeting signal, had no effect on the total, or free, lysine or 
threonine (or any other amino acid) composition of the seeds in any transformed 
line (Table 5, Example 10). These results demonstrate that expression in seeds of a 
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DHDPS enzyme is substantially insensitive to lysine i^^tion is not sufficient 
to lead to increased production or accumulation of free lysine. 

These teachings from transformants expressing the E. coli DHDPS enzyme 
indicate that lysine biosynthesis in leaves is controlled primarily via end-product 
inhibition of DHDPS, while in seeds there must be at least one additional point of 
control in the pathway. The teachings from transformants expressing the E. coli 
A Km and AKHT-M4 enzymes indicate that the level of free lysine in seeds controls 
the accumulation of all aspartate-derived amino acids through end-product 
inhibition of AK. AK is therefore an additional control point. 

To achieve simultaneous, high level expression of both E. coli DHDPS and 
AKIQ-M4 in leaves and seeds, plants that express each of the genes could be 
crossed and hybrids that express both could be selected. Another method would 
be to construct vectors that contain both genes on the same DNA fragment and 
introduce the linked genes into plants via transformation. This is preferred because 
the genes would remain linked thoughout subsequent plant breeding efforts. 
Representative vectors carrying both genes on the same DNA fragment are 
described in Examples 1 1, 12, 15, 16, 18, 19, and 25. 

Tobacco plants transformed with a vector carrying both E. coli DHDPS and 
AKHI-M4 genes linked to the 35S promoter are described in Example 1 1. In 
transformants that express little or no AKHI-M4, the level of expression of E. coli 
DHDPS determines the level of lysine accumulation in leaves (Example 11, 
Table 6). However, in transformants that express both AKIII-M4 and E. coli 
DHDPS, the ievel of expression of each protein plays a role in controlling the level 
of lysine accumulation. Transformed lines that express DHDPS at comparable 
levels accumulate more lysine when AKIH-M4 is also expressed (Table 6, compare 
lines 564-18A, 564-56A, 564-36E, 564-55B, and 564-47A). Thus, expression of a 
lysine-insensitive AK increases lysine accumulation in leaves when expressed in 
concert with a DHDPS enzyme that is 20-fold less sensitive to lysine than the 
endogenous plant enzyme. 

These leaf results, taken together with the seed results derived from 
expressing E. coli AKEH-M4 and E. coli DHDPS separately in seeds, suggest that 
simultaneous expression of both E. coli AKIII-M4 and E. coli DHDPS in seeds 
would lead to increased accumulation of free lysine and would also lead to an 
increased accumulation of free threonine. Tobacco plants transformed with a 
vector carrying both E. coli DHDPS and AKHI-M4 genes linked to the phaseolin 
promoter are described in Example 12. There is an increased accumulation of free 
lysine and free threonine in these plants. The increased level of free threonine was 
4-fold over normal seeds, rather than the 20-fold increase seen in seeds expressing 
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AKIH-M4 alone^Kie reduction In accumulation of free tSBcnine indicates that 
pathway intermediates are being diverted down the lysine branch of the 
biosynthetic pathway. The increased level of free lysine was 2-fold over normal 
seeds (or seeds expressing E. coli DHDPS alone). However, the lysine increase in 
seeds is not equivalent to the 100-fold increase seen in leaves. 

The E. coli DHDPS enzyme is less sensitive to lysine inhibition than plant 
DHDPS, but is still inhibited by lysine. The above teachings on the AK proteins 
indicate that expression of a completely lysine-insensitive enzyme can lead to a 
much greater accumulation of the aspartate pathway end-product threonine than 
expression of an enzyme which, while less sensitive than the plant enzyme, is still 
inhibited by lysine. Therefore vectors canying both Corynebacterium JDHDPS and 
AKHI-M4 genes linked to the seed-specific promoters were constructed as 
described in Examples 15 and 19. Tobacco plants transformed with vectors 
carrying both Corynebacterium DHDPS and AKUI-M4 genes linked to seed- 
specific promoters are described in Example 15. As shown in Table 9, these plants 
did not show a greater accumulation of free lysine in seeds than previously 
described plants expressing the E. coli DHDPS enzyme in concert with the lysine- 
insensitive AK. In hindsight this result can be explained by the fact that lysine 
accumulation in seeds never reached a level high enough to inhibit the E. coli 
DHDPS, so replacement of this enzyme with lysine-insensitive Corynebacterium 
DHDPS had no effect. 

In transformed lines expressing high levels of E. coli AKDI-M4 and E. coli 
DHDPS or Corynebacterium DHDPS . it was possible to detect substantial 
amounts of oc-aminoadipic acid in seeds. This compound is thought to be an 
intermediate in the catabolism of lysine in cereal seeds, but is normally detected 
only via radioactive tracer experiments due to its low level of accumulation. The 
discovery of high levels of this intermediate, comparable to levels of free amino 
acids, indicates that a large amount of lysine is being produced in the seeds of these 
transformed lines and is entering the catabolic pathway. The build-up of 
oc-aminoadipic acid was not observed in transform ants expressing only E. coli 
DHDPS or only AKIQ-M4 in seeds. These results show that it is necessary to 
express both enzymes simultaneously to produce high levels of free lysine in seeds. 
To accumulate high levels of free lysine it may also be necessary to prevent lysine 
catabolism. Alternatively, it may be desirable to convert the high levels of lysine 
produced into a form that is insensitive to breakdown, e.g. by incorporating it into 
a di-, tri- or oligopeptide, or a lysine-rich storage protein. 
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in Rapeseed and Soybean Plants 
To analyze for expression of the chimeric lysC and dapA genes in seeds of 
transformed rapeseed and soybean and to determine the consequences of 
expression on the amino acid content in the seeds, a seed meal can be prepared as 
described in Examples 16 or 19 or by any other suitable method. The seed meal 
can be partially or completely defatted, via hexane extraction for example, if 
desired. Protein extracts can be prepared from the meal and analyzed for AK 
and/or DHDPS enzyme activity. Alternatively the presence of the AK and/or 
DHDPS protein can be tested for immunologically by methods well-known to 
those skilled in the art. To measure free amino acid composition of the seeds, free 
amino acids can be extracted from the meal and analyzed by methods known to 
those skilled in the art (see Examples 8,16 and 19 for suitable procedures). 

All of the rapeseed transform ants obtained from a vector carrying the 
co rdapA gene expressed the Corynebacterium DHDPS protein, and six of eight 
transform ants obtained from a vector carrying the lysC -M4 gene expressed the 
AKHI-M4 protein (Example 16, Table 12). Thus it is straightforward to express 
these proteins in oilseed rape seeds. Transform ants expressing DHDPS protein ^ 
showed a greater than 100-fold increase in free lysine level in their seeds. There 
was a good correlation between transformants expressing higher levels of DHDPS 
protein and those having higher levels of free lysine. One transformant that 
expressed AKQI-M4 in the absence of Corynebacteria DHDPS showed a 5-fold 
increase in the level of free threonine in the seeds. Concomitant expression of both 
enzymes resulted in accumulation of high levels of free lysine, but not threonine. 

A high level of a-aminoadipic acid, indicative of lysine catabolism, was 
observed in many of the transformed lines, especially lines expressing the highest 
levels of DHDPS and AKIII protein. Thus, prevention of lysine catabolism by 
inactivation of lysine ketoglutarate reductase should further increase the 
accumulation of free lysine in the seeds. Alternatively, incorporation of lysine into 
a peptide or lysine-rich protein would prevent catabolism and lead to an increase in 
the accumulation of lysine in the seeds. 

To measure the total amino acid composition of mature rapeseed seeds, 
defatted meal was analyzed as described in Example 16. Relative amino acid levels 
in the seeds were compared as percentages of lysine to total amino acids. Seeds 
with a 5-100% increase in the lysine level, compared to the untransformed control, 
were observed. The transformant with the highest lysine content expressed high 
levels of both E. coli AKIII-M4 and Corynebacterium DHDPS. In this 
transformant lysine makes up about 13% of the total seed amino acids, 
considerably higher than any previously known rapeseed seed. 
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Six of sevewWybean transform ants expressed the DH^S protein. In the six 
transform ants that expressed DHDPS, there was excellent correlation between 
expression of GUS and DHDPS in individual seeds. Therefore, the GUS and 
DHDPS genes are integrated at the same site in the soybean genome. Four of 
seven transform ants expressed the AKJH protein, and again there was excellent 
correlation between expression of AKHI, GUS and DHDPS in individual seeds. 
Thus, in these four transform ants the GUS, AKIH and DHDPS genes are 
integrated at the same site in the soybean genome. 

Soybean transfoimants expressing Corynebacteria DHDPS alone and in 
concert with E. coli AKJH-M4 accumulated high levels of free lysine in their seeds. 
A high level of saccharopine, the first metabolic product of lysine catabolism, was 
also observed in seeds that contained high levels of lysine. Lesser amounts of a- 
amino adipic acid were also observed. Thus, prevention of lysine catabolism by 
inactivation of lysine ketoglutarate reductase should further increase the 
accumulation of free lysine in the soybean seeds. Alternatively, incorporation of 
lysine into a peptide or Iysine-rich protein would prevent catabolism and lead to an 
increase in the accumulation of lysine in the soybean seeds. 

Analyses of free lysine levels in individual seeds from transform ants in which 
the transgenes segregated as a single locus revealed that the increase in free lysine 
level was significantly higher in about one-fourth of the seeds. Since one-fourth of 
the seeds are expected to be homozygous for the transgene, it is likely that the 
higher lysine seeds are the homozygotes. Furthermore, this indicates that the level 
of increase in free lysine is dependent upon the transgene copy number. Therefore, 
lysine levels could be further increased by making hybrids of two different 
transform ants, and obtaining progeny that are homozygous at both transgene loci. 

The soybean seeds expressing Corynebacteria DHDPS showed substantial 
increases in accumulation of total seed lysine. Seeds with a 5-35% increase in total 
lysine content, compared to the untransformed control, were observed. In these 
seeds lysine makes up 7.5-7.7% of the total seed amino acids. 

Soybean seeds expressing Corynebacteria DHDPS in concert with E. coli 
AKDI-M4 showed much greater accumulation of total seed lysine than those 
expressing Corynebacteria DHDPS alone. Seeds with a more than four-fold 
increase in total lysine content were observed. In these seeds lysine makes up 
20-25% of the total seed amino acids, considerably higher than any previously 
known soybean seed. 
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session of lvsC and dapA Chimeric 
in Com Plants 




Com plants regenerated from transformed callus can be analyzed for the 
presence of the intact lvsC and dapA transgenes via Southern blot or PCR. Plants 
carrying the genes are either selfed or outcrossed to an elite line to generate Fl 
seeds. Six to eight seeds are pooled and assayed for expression of the 
Corynebacterium DHDPS protein and the E. coli AKIH-M4 protein by western 
blot analysis. The free amino acid composition and total amino acid composition 
of the seeds are determined as described above. 

Expression of the Corynebacterium DHDPS protein, and/or the E. coli 
AKIH-M4 protein can be obtained in the embryo of the seed using regulatory 
sequences active in the embryo, preferably derived from the globulin 1 gene, or in 
the endosperm using regulatory sequences active in the endosperm, preferably 
derived from the glutelin 2 gene or the lOkD zein gene (see Example 26 for 
details). Free lysine levels in the seeds is increased from about 1.4% of free amino 
acids in control seeds to 15-27% in seeds of transform ants expressing 
Corynebacterium DHDPS alone from the globulin 1 promoter. The increased free 
lysine was localized to the embryo in seeds expressing Corynebacterium DHDPS 
from the globulin 1 promoters 




The large increases in free lysine result in significant increases in the total 
seed lysine content. Total lysine levels can be increased at least 130% in seeds 
expressing Corynebacterium DHDPS from the globulin 1 promoter. Greater 
increases in free lysine levels can be achieved by expressing E. coli AKHI-M4 
protein from the globulin 1 promoter in concert with Corynebacterium DHDPS. 

Lysine catabolism is expected to be much greater in the com endosperm than 
the embryo. Thus, to achieve significant lysine increases in the endosperm it is 
preferable to express both Corynebacterium DHDPS and the E. coli AKUI-M4 in 
the endosperm and to reduce lysine catabolism by reducing the level of lysine 
ketoglutarate reductase as described below. 



It may be desirable to prevent lysine catabolism in order to accumulate higher 
levels of free lysine and to prevent accumulation of lysine breakdown products 
such as saccharopine and oc-amino adipic acid. Evidence indicates that lysine is 
catabolized in plants via the saccharopine pathway. The first enzymatic evidence 
for the existence of this pathway was the detection of lysine ketoglutarate 
reductase (LKR) activity in immature endosperm of developing maize seeds 
[Arruda et al. (1982) Plant Physiol. (59:988-989]. LKR catalyzes the first step in 
lysine catabolism, the condensation of L-lysine with oc-ketoglutarate into 
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onset of endosperm development in com, reaches a peak level at about 20 days 
after pollination, and then declines [Arruda et al. (1983) Phytochemistry 
22:2687-2689]. In order to prevent the catabolism of lysine it would be desirable 
to reduce or eliminate LKR expression or activity. This could be accomplished by 
cloning the LKR gene, preparing a chimeric gene for cosuppression of LKR or 
preparing a chimeric gene to express antisense RNA for LKR, and introducing the 
chimeric gene into plants via transformation. Alternatively, plant mutants could be 
obtained wherein LKR enzyme activity is absent. 

Several methods to clone a plant LKR gene are available to one skilled in the 
art. The protein can be purified from corn endosperm, as described in Brochetto- 
Braga et al. [(1992) Plant Physiol. 98: 1 139-1 147] and used to raise antibodies. 
The antibodies can then be used to screen an cDNA expression library for LKR 
clones. Alternatively the purified protein can be used to determine amino acid 
sequence at the amino-terminal of the protein or from protease derived internal 
peptide fragments. Degenerate oligonucleotide probes can be prepared based upon 
the amino acid sequence and used to screen a plant cDNA or genomic DNA library 
via hybridization. 

Another method makes use of an E. coli strain that is unable to grow in a 
synthetic medium containing 20 |Xg/mL of L-ly sine. Expression of LKR full-length 
cDNA in this strain will reverse the growth inhibition by reducing the lysine 
concentration. Construction of a suitable E. coli strain and its use to select clones 
from a plant cDNA library that lead to lysine-resistant growth is described in 
Example 20. 

Yet another method relies upon homology between plant LKR and 
saccharopine dehydrogenase. Fungal saccharopine dehydrogenase (glutamate- 
fonning) and saccharopine dehydrogenase (lysine-foiming) catalyze the final two 
steps in the fungal lysine biosynthetic pathway. Plant LKR and fungal 
saccharopine dehydrogenase (lysine-forming) catalyze both forward and reverse 
reactions, use identical substrates and use similar co-factors. Similarly, plant 
saccharopine dehydrogenase (glutamate-forming), which catalyzes the second step 
in the lysine catabolic pathway, works in both forward and reverse reactions, uses 
identical substrates and uses similar co-factors as fungal saccharopine 
dehydrogenase (glutamate-forming). Several genes for fungal saccharopine 
dehydrogenases have been isolated and sequenced and are readily available to 
those skilled in the ait [Xuan et al. (1990) Mol. Cell. Biol. 7(7:4795-4806, Feller 
et al. (1994) MoL Cell. Biol. 14:641 1-6418]. These genes could be used as 
heterologous hybridization probes to identify plant LKR and plant saccharopine 
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dehydrogenase amate-forming) nucleic acid fragme r alternatively to 
identify homologous protein coding regions in plant cDNAs. 

Biochemical and genetic evidence derived from human and bovine studies 
has demonstrated that mammalian LKR and saccharopine dehydrogenase 
(glutamate-forming) enzyme activities are present on a single protein with a 
monomer molecular weight of about 1 17,000. This contrasts with the fungal 
enzymes which are carried on separate proteins, saccharopine dehydrogenase 
(lysine-formixig) with a molecular weight of about 44,000, and saccharopine 
dehydrogenase (glutamate-forming) with a molecular weight of about 51,000. 
Plant LKR has been reported to have a molecular weight of about 140,000 
indicating that it is like the animal catabolic protein wherein both LKR and 
saccharopine dehydrogenase (glutamate-forming) enzyme activities are present on 
a single protein. 

We provide two plant saccharopine dehydrogenase (glutamate-forming) 
nucleic acid fragments that contain cDNA derived from Arabidopsis thaliana. 
These were identified as cDNAs that encode proteins homologous to fungal 
saccharopine dehydrogenase (glutam ate -forming). These nucleic acid fragments 
can be used as hybridization probes to identify and isolate genomic DNA fragments 
or cDNA fragments encoding both LKR and saccharopine dehydrogenase 
(glutamate-fonning) enzyme activities from any plant desired. 

In order to block expression of the LKR gene in transformed plants, a 
chimeric gene designed for cosuppression of LKR can be constructed by linking 
the LKR gene or gene fragment to any of the plant promoter sequences described 
above. (See U.S. Patent No. 5,231,020 for methodology to block plant gene 
expression via cosuppression.) Alternatively, a chimeric gene designed to express 
antisense RNA for all or part of the LKR gene can be constructed by linking the 
LKR gene or gene fragment in reverse orientation to any of the plant promoter 
sequences described above. (See U.S. patent 5,107,065 for methodology to block 
plant gene expression via antisense RNA.) Either the cosuppression or antisense 
chimeric gene could be introduced into plants via transformation. Transfoimants 
wherein expression of the endogenous LKR gene is reduced or eliminated are then 
selected. 

Preferred promoters for the chimeric genes would be seed-specific 
promoters. For soybean, rapeseed and other dicotyledonous plants, strong seed- 
specific promoters from a bean phaseolin gene, a soybean P-conglycinin gene, 
glycinin gene, Kunitz trypsin inhibitor gene, or rapeseed napin gene would be 
preferred. For com and other monocotyledonous plants, a strong endosperm- 
specific promoter, e.g., the 10 kD or 27 kD zein promoter, would be preferred. 
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;enes can be 



obtained by the methods described above. In order to obtain transformed plants 
that express a chimeric gene for cosuppression of LKR or antisense LKR, as well 
as a chimeric genes encoding substantially lysine-insensitive DHDPS and AK, the 
cosuppression or antisense LKR gene could be linked to the chimeric genes 
encoding substantially lysine-insensitive DHDPS and AK and the three genes could 
be introduced into plants via transformation. Alternatively, the chimeric gene for 
cosuppression of LKR or antisense LKR could be introduced into previously 
transformed plants that express substantially lysine-insensitive DHDPS and AK, or 
the cosuppression or antisense LKR gene could be introduced into normal plants 
and the transfonnants obtained could be crossed with plants that express 
substantially lysine-insensitive DHDPS and AK. As another alternative, plant 
mutants that express no LKR enzyme activity could be crossed with plants that 
express substantially lysine-insensitive DHDPS and AK. 



It may be desirable to convert the high levels of lysine produced into a form 
that is insensitive to breakdown, e.g., by incorporating it into a di-, tri- or 
oligopeptide, or a lysine-rich storage protein. No natural lysine-rich proteins are 
known. 

One aspect of this invention is the design of polypeptides which can be 
expressed in vivo to serve as lysine-rich seed storage proteins. Polypeptides are 
linear polymers of amino acids where the cc-carboxyl group of one amino acid is 
covalently bound to the cc-amino group of the next amino acid in the chain. Non- 
covalent interactions among the residues in the chain and with the surrounding 
solvent determine the final conformation of the molecule. Those skilled in the art 
must consider electrostatic forces, hydrogen bonds, Van der Waals forces, 
hydrophobic interactions, and conformational preferences of individual amino acid 
residues in the design of a stable folded polypeptide chain [see for example: 
Creighton, (1984) Proteins, Structures and Molecular Properties, W. H. Freeman 
and Company, New York, pp 133-197, or Schulz et al., (1979) Principles of 
Protein Structure, Springer Verlag, New York, pp 27-45]. The number of 
interactions and their complexity suggest that the design process may be aided by 
the use of natural protein models where possible. 

The synthetic storage proteins (SSPs) embodied in this invention are chosen 
to be polypeptides with the potential to be enriched in lysine relative to average 
levels of proteins in plant seeds. Lysine is a charged amino acid at physiological 
pH and is therefore found most often on the surface of protein molecules [Chotia, 
(1976) Journal of Molecular Biology 105: 1-14]. To maximize lysine content, 
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Applicants cho; 




olecular shape with a high surface-t< 



ume ratio for the 



synthetic storage proteins embodied in this invention. The alternatives were either 
to stretch the common globular shape of most proteins to form a rod-like extended 
structure or to flatten the globular shape to a disk -like structure. Applicants chose 
5 the former configuration as there are several natural models for long rod-like 

proteins in the class of fibrous proteins [Creighton, (1984) Proteins, Structures and 
Molecular Properties, W.H. Freeman and Company, New York, p 191]. 

Coiled-coils constitute a well-studied subset of the class of fibrous proteins 
[see Cohen et al., (1986) Trends Biochem. ScL 77:245-248]. Natural examples are 
10 found in a-keratins, paramyosin, light meromyosin and tropomyosin. These 

protein molecules consist of two parallel alpha helices twisted about each other in a 
left-handed supercoil. The repeat distance of this supercoil is 140 A (compared to 
a repeat distance of 5.4 A for one turn of the individual helices). The supercoil 
causes a slight skew (10°) between the axes of the two individual alpha helices. 
15 In a coiled coil there are 3.5 residues per turn of the individual helices 

resulting in an exact 7 residue periodicity with respect to the supeihelix axis (see 
Figure 1). Every seventh amino acid in the polypeptide chain therefore occupies an 
equivalent position with respect to the helix axis. Applicants refer to the seven 
positions in this heptad unit of the invention as(defgabc)as shown in Figures 1 
and 2a. This conforms to the conventions used in the coiled-coil literature. 

The a and d amino acids of the heptad follow a 4,3 repeat pattern in the 
primary sequence and fall on one side of an individual alpha helix (See Figure 1). 
If the amino acids on one side of an alpha helix are all non-polar, that face of the 
helix is hydrophobic and will associate with other hydrophobic surfaces as, for 
example, the non-polar face of another similar helix. A coiled-coil structure results 
when two helices dimerize such that their hydrophobic faces are aligned with each 
other (See Figure 2a). 

The amino acids on the external faces of the component alpha helices (b, c, e, 
f, g) are usually polar in natural coiled-coils in accordance with the expected 
pattern of exposed and buried residue types in globular proteins [Schulz, et al., 
(1979) Principles of Protein Structure. Springer Verlag, New York, p 12; Talbot, 
et al , (1982) Acc. Chem. Res. 75:224-230; Hodges et al., (1981) Journal of 
Biological Chemistry 25(5:1214-1224]. Charged amino acids are sometimes found 
forming salt bridges between positions e and g' or positions g and e' on the 
opposing chain (see Figure 2a). 

Thus, two amphipathic helices like the one shown in Figure 1 are held 
together by a combination of hydrophobic interactions between the a, a', d, and d' 
residues and by salt bridges between e and g' and/or g and e ? residues. The packing 
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of the hydrophot^^sidues in the supercoil maintains the ^^ 1S "in register". For 
short polypeptides comprising only a few turns of the component alpha helical 
chains, the 10° skew between the helix axes can be ignored and the two chains 
treated as parallel (as shown in Figure 2a). 





5 



A number of synthetic coiled-coils have been reported in the literature (Lau 



et al., (1984) Journal of Biological Chemistry 259:13253-13261; Hodges et al., 
(1988) Peptide Research 7:19-30; DeGrado et al., (1989) Science 245:622-628; 
O'Neil et al., (1990) Science 250:646-651]. Although these polypeptides vary in 
size, Lau et al. found that 29 amino acids were sufficient for dimerization to form 
10 the coiled-coil structure [Lau et al., (1984) Journal of Biological Chemistry 

259: 13253-13261]. Applicants constructed the polypeptides in this invention as 
28-residue and larger chains for reasons of conformational stability. 

The polypeptides of this invention are designed to dimerize with a coiled-coil 
motif in aqueous environments. Applicants have used a combination of 
15 hydrophobic interactions and electrostatic interactions to stabilize the coiled-coil 
conformation. Most nonpolar residues are restricted to the a and d positions which 
creates a hydrophobic stripe parallel to the axis of the helix. This is the 
dimerization face. Applicants avoided large, bulky amino acids along this face to 
minimize steric interference with dimerization and to facilitate formation of the 
20 stable coiled-coil structure. 

Despite recent reports in the literature suggesting that methionine at 
positions a and d is destabilizing to coiled-coils in the leucine zipper subgroup 
[Landschulz et al., (1989) Science 245:1681-1688 and Hu et al., (1990) Science 
250:1400-1403], Applicants chose to substitute methionine residues for leucine on 
the hydrophobic face of the SSP polypeptides. Methionine and leucine are similar 
in molecular shape (Figure 3). Applicants demonstrated that any destabilization of 
the coiled-coil that may be caused by methionine in the hydrophobic core appears 
to be compensated in sequences where the formation of salt bridges (e-g 1 and g-e f ) 
occurs at all possible positions in the helix (i.e., twice per heptad). 

To the extent that it is compatible with the goal of creating a polypeptide 
enriched in lysine, Applicants minimized the unbalanced charges in the polypeptide. 
This may help to prevent undesirable interactions between the synthetic storage 
proteins and other plant proteins when the polypeptides are expressed in vivo. 

The polypeptides of this invention are designed to spontaneously fold into a 
defined, conformationally stable structure, the alpha helical coiled-coil, with 
minimal restrictions on the primary sequence. This allows synthetic storage 
proteins to be custom -tailored for specific end-user requirements. Any amino acid 
can be incorporated at a frequency of up to one in every seven residues using the b, 
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c, and f position^^he heptad repeat unit. Applicants ndj^^at up to 43% of an 

essential amino acid from the group isoleucine, leucine, lysine, methionine, 
threonine, and valine can be incorporated and that up to 14% of the essential amino 
acids from the group phenylalanine, tryptophan, and tyrosine can be incorporated 
into the synthetic storage proteins of this invention. 

In the SSPs only Met, Leu, He, Val or Thr are located in the hydrophobic 
core. Furthermore, the e, g, e', and g positions in the SSPs are restricted such that 
an attractive electrostatic interaction always occurs at these positions between the 
two polypeptide chains in an SSP dimer. This makes the SSP polypeptides more 
stable as dimers. 

Thus, the novel synthetic storage proteins described in this invention 
represent a particular subset of possible coiled-coil polypeptides. Not all 
polypeptides which adopt an amphipathic alpha helical conformation in aqueous 
solution are suitable for the applications described here. 

The following rules derived from Applicants' work define the SSP 
polypeptides that Applicants use in their invention: 

The synthetic polypeptide comprises n heptad units (d e f g a b c), each 
heptad being either the same or different, wherein: 
n is at least 4; 

a and d are independently selected from the group consisting of 

Met, Leu, Val, lie and Thr; 
e and g are independently selected from the group consisting of the 
acid/base pairs Glu/Lys, Lys/Glu, Arg/Glu, Arg/Asp, Lys/Asp, 
Glu/Arg, Asp//Arg and Asp/Lys; and 
b, c and f are independently any amino acids except Gly or Pro and 
at least two amino acids of b, c and f in each heptad are 
selected from the group consisting of Glu, Lys, Asp, Arg, His, 
Thr, Ser, Asn, Gin, Cys and Ala. 
Chimeric Gene s Encoding: Lvsine-Rich Polypeptides 
DNA sequences which encode the polypeptides described above can be 
designed based upon the genetic code. Where multiple codons exist for particular 
amino acids, codons should be chosen from those preferable for translation in 
plants. Oligonucleotides corresponding to these DNA sequences can be 
synthesized using an ABI DNA synthesizer, annealed with oligonucleotides 
corresponding to the complementary strand and inserted into a plasmid vector by 
methods known to those skilled in the art. The encoded polypeptide sequences can 
be lengthened by inserting additional annealed oligonucleotides at restriction 
endonuclease sites engineered into the synthetic gene. Some representative 
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strategies for con ting genes encoding lysine-rich poly^^des of the invention, 
as well as DNA and amino acid sequences of preferred embodiments are provided 
in Example 21. 

A chimeric gene designed to express RNA for a synthetic storage protein 
5 gene encoding a lysine-rich polypeptide can be constructed by linking the gene to 
any of the plant promoter sequences described above. Preferred promoters would 
be seed-specific promoters. For soybean, rapeseed and other dicotyledonous 
plants strong seed-specific promoters from a bean phaseolin gene, a soybean 
p-conglycinin gene, glycinin gene, Kunitz trypsin inhibitor gene, or rapeseed napin 
10 gene would be preferred. For corn or other monocotyledonous plants, a strong 
endosperm-specific promoter, e.g., the 10 kD or 27 kD zein promoter, or a strong 
embyro-specific promoter, e.g., the com globulin 1 promoter, would be preferred. 

In order to obtain plants that express a chimeric gene for a synthetic storage 
protein gene encoding a lysine-rich polypeptide, plants can be transformed by any 
15 of the methods described above. In order to obtain plants that express both a 
chimeric SSP gene and chimeric genes encoding substantially lysine-insensitive 
DHDPS and AK, the SSP gene could be linked to the chimeric genes encoding 
substantially lysine-insensitive DHDPS and AK and the three genes could be 
introduced into plants via transformation. Alternatively, the chimeric SSP gene 
could be introduced into previously transformed plants that express substantially 
lysine-insensitive DHDPS and AK, or the SSP gene could be introduced into 
normal plants and the transfonnants obtained could be crossed with plants that 
express substantially lysine-insensitive DHDPS and AK. 

Results from genetic crosses of transformed plants containing lysine 
biosynthesis genes with transformed plants containing lysine-rich protein genes (see 
Example 23) demonstrate that the total lysine levels in seeds can be increased by 
the coordinate expression of these genes. This result was especially striking 
because the gene copy number of all of the transgenes was reduced in the hybrid. 
It is expected that the lysine level would be further increased if the biosynthesis 
genes and the lysine-rich protein genes were all homozygous. 

Use of the cts/lysC-M4 Chimeric Gene as a 
Selectable Marker for Plant Transformation 
Growth of cell cultures and seedlings of many plants is inhibited by high 
concentrations of lysine plus threonine. Growth is restored by addition of 
methionine (or homoserine which is converted to methionine in vivo). Lysine plus 
threonine inhibition is thought to result from feedback inhibition of endogenous 
AK, which reduces flux through the pathway leading to starvation for methionine. 
In tobacco there are two AK enzymes in leaves, one ly sine-sensitive and one 
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threonine sensit^fc^egrutui et al. (1984) Theor. AppL 65:11-20]. High 

concentrations oflysine plus threonine inhibit growth of shoots from tobacco leaf 
disks and inhibition is reversed by addition of low concentrations of methionine. 
Thus, growth inhibition is presumably due to inhibition of the two AK isozymes. 

Expression of active lysine and threonine insensitive AKIH-M4 also reverses 
lysine plus threonine growth inhibition (Table 2, Example 7). There is a good 
correlation between the level of AKEH-M4 protein expressed and the resistance to 
lysine plus threonine. Expression of lysine-sensitive wild type AKHI does not have 
a similar effect. Since expression of the AKIII-M4 protein permits growth under 
normally inhibitory conditions, a chimeric gene that causes expression of 
AKTTT-M4 in plants can be used as a selectable genetic marker for transformation 
as illustrated in Examples 13 and 17. 

EXAMPLES 

The present invention is further defined in the following Examples, in which 
all parts and percentages are by weight and degrees are Celsius, unless otherwise 
stated. It should be understood that these Examples, while indicating preferred 
embodiments of the invention, are given by way of illustration only. From the 
above discussion and these Examples, one skilled in the art can ascertain the 
essential characteristics of this invention, and without departing from the spirit and 
scope thereof, can make various changes and modifications of the invention to 
adapt it to various usages and conditions. 

EXAMPLE 1 
Isolation of the E. coli lvsC Gene and mutations 
in lysC resulting in lysine-insensitive AKm 
The E. coli lysC gene has been cloned, restriction endonuclease mapped and 
sequenced previously [Cassan et al. (1986) /. BioL Chem. 2(51:1052-1057]. For 
the present invention the lysC gene was obtained on a bacteriophage lambda clone 
from an ordered library of 3400 overlapping segments of cloned E. coli DNA 
constructed by Kohara, Akiyama and Isono [Kohara et al. (1987) Cell 
50:595-508]. This library provides a physical map of the whole E. coli 
chromosome and ties the physical map to the genetic map. From the knowledge of 
the map position of lysC at 90 min on the E. coli genetic map [Theze et al. (1974) 
/. Bacteriol. 117:133-143], the restriction endonuclease map of the cloned gene 
[Cassan et al. (1986)/. BioL Chem. 267:1052-1057], and the restriction 
endonuclease map of the cloned DNA fragments in the E. coli library [Kohara 
et al. (1987) Cell 50:595-508], it was possible to choose lambda phages 4E5 and 
7A4 [Kohara et al. (1987) Cell 50:595-508] as likely candidates for carrying the 
lvsC gene. The phages were grown in liquid culture from single plaques as 
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described [see C^Jnt Protocols in Molecular Biology (1 Ausubel et al. Eds. 
John Wiley & Sons New York] using LE392 as host [see Sam brook et al. (1989) 
Molecular Cloning, A Laboratory Manual, Cold Spring Harbor Laboratory Press]. 
Phage DNA was prepared by phenol extraction as described [see Current Protocols 
in Molecular Biology (1987) Ausubel et al. eds. John WUey & Sons New York]. 

From the sequence of the gene several restriction endonuclease fragments 
diagnostic for the lvsC gene were predicted, including an 1860 bp EcoR I-Nhe I 
fragment, a 2140 bp EcoR I-Xmn I fragment and a 1600 bp EcoR I-BamH I 
fragment. Each of these fragments was detected in both of the phage DNAs 
confirming that these carried the lvsC gene. The EcoR I-Nhe I fragment was 
isolated and subcloned in plasmid pBR322 digested with the same enzymes, 
yielding an ampicillin-resistant, tetracycline-sensitive E. coli transform ant. The 
plasmid was designated pBT436. 

To establish that the cloned lvsC gene was functional, pBT436 was 
transformed into E. coli strain Gif 106M1 (E. coli Genetic Stock Center strain 
CGSC-5074) which has mutations in each of the three E. coli AK genes [Theze 
et al. (1974) /. BacterioL 7/7:133-143]. This strain lacks all AK activity and 
therefore requires diaminopimelate (a precursor to lysine which is also essential for 
cell wall biosynthesis), threonine and methionine. In the transformed strain all 
these nutritional requirements were relieved demonstrating that the cloned lygC 
gene encoded functional AKHL 

Addition of lysine (or diaminopimelate which is readily converted to lysine in 
vivo) at a concentration of approximately 0.2mM to the growth medium inhibits 
the growth of Gifl06Ml transformed with pBT436. M9 media [see Sambrook 
et al. (1989) Molecular Cloning, A Laboratory Manual, Cold Spring Harbor 
Laboratory Press] supplemented with the arginine and isoleucine, required for 
Gifl06Ml growth, and ampicillin, to maintain selection for the pBT436 plasmid, 
was used. This inhibition is reversed by addition of threonine plus methionine to 
the growth media. These results indicated that AKDI could be inhibited by 
exogenously added lysine leading to starvation for the other amino acids derived 
from aspartate. This property of pBT436-transformed Gifl06Ml was used to 
select for mutations in lvsC that encoded lysine-insensitive AKHL 

Single colonies of Gifl06Ml transformed with pBT436 were picked and 
resuspended in 200 |XL of a mixture of 100 [iL 1 % lysine plus 100 ^L of M9 
media. The entire cell suspension containing 10 7 -10 8 cells was spread on a petri 
dish containing M9 media supplemented with the arginine, isoleucine, and 
ampicillin. Sixteen petri dishes were thus prepared. From 1 to 20 colonies 
appeared on 11 of the 16 petri dishes. One or two (if available) colonies were 
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were obtained. Plasmid DNA was prepared from eight of these and re-transfonned 
into Gif 106M1 to determine whether the lysine resistance determinant was 
plasmid-bome. Six of the eight plasmid DNAs yielded lysine-resistant colonies. 
Three of these six carried lysC genes encoding AKHI that was uninhibited by 
15 mM lysine, whereas wild type AKHI is 50% inhibited by 0.3-0.4 mM lysine and 
>90% inhibited by 1 mM lysine (see Example 2 for details). 

To determine the molecular basis for lysine -resistance the sequences of the 
wild type lysC gene and three mutant genes were determined. A method for 
"Using mini-prep plasmid DNA for sequencing double stranded templates with 
sequenase™" (Kraft et al. (1988) BioTechniques (5:544-545] was used. 
Oligonucleotide primers, based on the published lysC sequence and spaced 
approximately every 200 bp, were synthesized to facilitate the sequencing. The 
sequence of the wild type lvsC gene cloned in pBT436 (SEQ ID NO:l) differed 
from the published lvsC sequence in the coding region at 5 positions. Four of these 
nucleotide differences were at the third position in a codon and would not result in 
a change in the amino acid sequence of the AKHI protein. One of the differences 
would result in a cysteine to glycine substitution at amino acid 58 of AKHI. These 
differences are probably due to the different strains from which the lysC genes 
were cloned. 

The sequences of the three mutant lysC genes that encoded lysine-insensitive 
AK each differed from the wild type sequence by a single nucleotide, resulting in a 
single amino acid substitution in the protein. Mutant M2 had an A substituted for a 
G at nucleotide 954 of SEQ ID NO: 1 resulting in an isoleucine for methionine 
substitution at amino acid 318 and mutants M3 and M4 had identical T for C 



substitutions at nucleotide 1055 of SEQ ID NO: 1 resulting in an ^eucin^ for 
threonine substitution at amino acid 352. Thus, either of these single amino acid 



High level expression of wild type and mutant lvsC ge nes in E. coli 
An Nco I (CCATGG) site was inserted at the translation initiation codon of 
the lysC gene using the following oligonucleotides: 

SEQ ID NO:2: 

GATCCATGGC TGAAATTGTT GTCTCCAAAT TTGGCG 
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SEQ ID NO:3: 

GTACCGCCAA ATTTGGAGAC AACAATTTCA GCCATG 



When annealled these oligonucleotides have BamH I and Asp718 "sticky" ends. 
The plasmid pBT436 was digested with BamH I, which cuts upstream of the lysC 
coding sequence and Asp718 which cuts 31 nucleotides downstream of the 
initiation codon. The annealled oligonucleotides were ligated to the plasmid vector 
and £. coli transform ants were obtained. Plasmid DNA was prepared and 
screened for insertion of the oligonucleotides based on the presence of an Nco I 
site. A plasmid containing the site was sequenced to assure that the insertion was 
correct, and was designated pBT457. In addition to creating an Nco I site at the 
initiation codon of lvsC . this oligonucleotide insertion changed the second codon 
from TCT, coding for serine, to GCT, coding for alanine. This amino acid 
substitution has no apparent effect on the AKHI enzyme activity. 

To achieve high level expression of the IvsC genes in £. coli, the bacterial 
expression vector pBT430 was used. This vector is a derivative of pET-3a 
[Rosenberg et al. (1987) Gene 5(5:125-135] which employs the bacteriophage T7 
RNA polymerase/17 promoter system. Plasmid pBT430 was constructed by first 
destroying the EcoR I and Hind HI sites in pET-3a at their original positions. An 
oligonucleotide adaptor containing EcoR I and Hind m sites was inserted at the 
BamH I site of pET-3a. This created pET-3aM with additional unique cloning 
sites for insertion of genes into the expression vector. Then, the Nde I site at the 
position of translation initiation was converted to an Nco I site using 
oligonucleotide-directed mutagenesis. The DNA sequence of pET-3aM in this 
region, 5- CATATG G. was converted to 5-C CCATGG in pBT430. 

The lysC gene was cut out of plasmid pBT457 as a 1560 bp Nco I-EcoR I 
fragment and inserted into the expression vector pBT430 digested with the same 
enzymes, yielding plasmid pBT461 . For expression of the mutant lysC genes (M2, 
M3 and M4) pBT461 was digested with Kpn I-EcoR I, which removes the wild 
type lysC gene from about 30 nucleotides downstream from the translation start 
codon, and inserting the homologous Kpn I-EcoR I fragments from the mutant 
genes yielding plasmids pBT490, pBT491 and pBT492, respectively. 

For high level expression each of the plasmids was transformed into E. coli 
strain BL21(DE3) [Studier et al. (1986)/. Mol. Biol. 759:113-130]. Cultures 
were grown in LB medium containing ampicillin (100 mg/L) at 25°C. At an 
optical density at 600 nm of approximately 1, EPTG (isopropylthio-p-galactoside, 
the inducer) was added to a final concentration of 0.4 mM and incubation was 
continued for 3 h at 25 °C. The cells were collected by centrifugation and 
resuspended in l/20th (or 1/1 00th) the original culture volume in 50 mM NaCl; 
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50 mM Tris-ClJ^7.5; 1 mM EDTA, and frozen at -20^HTOzen aliquots of 
1 mL were thawed at 37°C and sonicated, in an ice-water bath, to lyse the cells. 
The lysate was centrifuged at 4°C for 5 min at 15,000 rpm. The supernatant was 
removed and the pellet was resuspended in 1 mL of the above buffer. 

The supernatant and pellet fractions of uninduced and IPTG-induced cultures 
of BL21(DE3)/fcBT461 were analyzed by SDS polyacrylamide gel electrophoresis. 
The major protein visible by Coomassie blue staining in the supernatant of the 
induced culture had a molecular weight of about 48 kd, the expected size for 
AKTTT About 80% of the AKHI protein was in the supernatant and AKm 
represented 10-20% of the total E. coli protein in the extract. 

AK activity was assayed as shown below: 
Assay mix (for 12 assay tubes): 
4.5mLH 2 0 
1.0mL8MKOH 
1.0 mL 8MNH 2 OH-HCl 
1.0 mL 1M Tris-HCl pH 8.0 
0.5 mL 0.2M ATP (121 mg/mL in 0.2M NaOH) 
50^LlMMgSO 4 

Each 1.5 mL eppendorf assay tube contained: 
0.64 mL assay mix 

0.04 mL 0.2 M L-aspartic acid or 0.04 mL H2O 
0.0005-0.12 mL extract 
H2O to total volume 0.8 mL 

Assay tubes were incubated at 30°C for desired time (10-60 min). Then 
0.4 mL FeCl3 reagent (10% w/v FeCl3, 33% trichloroacetic acid, 0.7 M HQ) was 
added and the material centrifuged for 2 min in an eppendorf centrifuge. The 
supernatant was decanted. The OD was read at 540 nm and compared to the 
aspartyl-hydroxamate standard. 

Approximately 80% of the AKHI activity was in the supernatant fraction. 
The specific activity of wild type and mutant crude extracts was 5-7 product 
per min per milligram total protein. Wild type AKIH was sensitive to the presence 
of L-lysine in the assay. Fifty percent inhibition was found at a concentration of 
about 0.4 mM and 90% inhibition at about 1.0 mM. In contrast, mutants 
AKHI-M2, M3 and M4 (see Example 1) were not inhibited at all by 15 mM 
L-lysine. 
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Wild type^Bn protein was purified from the supt^^^nt of the IPTG- 
induced culture as follows. To 1 mL of extract, 0.25 mL of 10% streptomycin 
sulfate was added and kept at 4°C overnight. The mixture was centrifuged at 4°C 
for 15 min at 15,000 rpm. The supernatant was collected and desalted using a 
Sephadex G-25 M column (Column PD-10, Pharmacia). It was then run on a 
Mono-Q HPLC column and eluted with a 0-1M NaCl gradient. The two 1 mL 
fractions containing most of the AKm activity were pooled, concentrated, desalted 
and run on an HPLC sizing column (TSK G3000SW). Fractions were eluted in 
20 mM KPO4 buffer, pH7.2, 2 mM MgS04, 10 mM 0-mercaptoethanol, 0,15M 
KQ, 0.5 mM L-lysine and were found to be >95% pure by SDS polyacrylamide gel 
electrophoresis. Purified AKHI protein was sent to Hazelton Research Facility 
(310 Swampridge Road, Denver, PA 17517) to have rabbit antibodies raised 
against the protein. 

EXAMPLE 3 

Isolation of the E. coli and Corvnebacterium glutamicum dapA genes 
The E. coli dapA gene (eco dapA ) has been cloned, restriction endonuclease 
mapped and sequenced previously [Richaud et al. (1986) /. BacterioL 
7(5(5:297-300]. For the present invention the dapA gene was obtained on a 
bacteriophage lambda clone from an ordered library of 3400 overlapping segments 
of cloned E. coli DNA constructed by Kohara, Akiyama and Isono [Kohara et al. 
(1987) Cell 50:595-508, see Example 1]. From the knowledge of the map position 
of dap A at 53 min on the E. coli genetic map [Bachman (1983) Microbiol. Rev. 
47:180-230], the restriction endonuclease map of the cloned gene [Richaud et al. 
(1986) /. BacterioL 7(5(5:297-300], and the restriction endonuclease map of the 
cloned DNA fragments in the E. coli library [Kohara et al. (1987) Cell 
50:595-508], it was possible to choose lambda phages 4C1 1 and 5A8 [Kohara 
et al. (1987) Cell 50:595-508] as likely candidates for carrying the dapA gene. 
The phages were grown in liquid culture from single plaques as described [see 
Current Protocols in Molecular Biology (1987) Ausubel et al. eds., John Wiley & 
Sons New York] using LE392 as host [see Sambrook et al. (1989) Molecular 
Cloning, A Laboratory Manual, Cold Spring Harbor Laboratory Press]. Phage 
DNA was prepared by phenol extraction as described [see Current Protocols in 
Molecular Biology (1987) Ausubel et al. eds., John Wiley & Sons New York]. 
Both phages contained an approximately 2.8 kb Pst I DNA fragment expected for 
the dapA gene [Richaud et al. (1986) /. BacterioL 7(5(5:297-300]. The fragment 
was isolated from the digest of phage 5A8 and inserted into Pst I digested vector 
pBR322 yielding plasmid pBT427. 
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cerium dapA gene (co rdapA ) was is< 




from 



genomic 



DNA from ATCC strain 13032 using polymerase chain reaction (PCR). The 
nucleotide sequence of the Corynebacterium dapA gene has been published 
[Bonnassie et al. (1990) Nucleic Acids Res, 75:6421]. From the sequence it was 
5 possible to design oligonucleotide primers for PCR that would allow amplification 
of a DNA fragment containing the gene, and at the same time add unique 
restriction endonuclease sites at the start codon (Nco I) and just past the stop 
codon (EcoR I) of the gene. The oligonucleotide primers used were: 

10 SEQ ID NO:4: 

CCCGGGCCAT GGCTACAGGT TTAACAGCTA AGACCGGAGT AGAGCACT 
SEQ ID NO:5: 

GATATCGAAT TCTCATTATA GAACTCCAGC TTTTTTC 

15 

PCR was performed using a Perkin-Elmer Cetus kit according to the 
instructions of the vendor on a thermocycler manufactured by the same company. 
The reaction product, when run on an agarose gel and stained with ethidium 
bromide, showed a strong DNA band of the size expected for the Corynebacterium 
dapA gene, about 900 bp. The PCR-generated fragment was digested with 
restriction endonucleases Nco I and EcoR I and inserted into expression vector 
pBT430 (see Example 2) digested with the same enzymes. In addition to 
introducing an Nco I site at the translation start codon, the PCR primers also 
resulted in a change of the second codon from AGC coding for serine to GCT 
coding for alanine. Several clones that expressed active, lysine-insensitive DHDPS 
(see Example 4) were isolated, indicating that the second codon amino acid 
substitution did not affect activity; one clone was designated FS766. 

The Nco I to EcoR I fragment carrying the PCR-generated Corynebacterium 
dapA gene was subcloned into the phagemid vector pGEM-9Zf(-) from Promega, 
single-stranded DNA was prepared and sequenced. This sequence is shown in 
SEQ ID NO:6. 

Aside from the differences in the second codon already mentioned, the 
sequence matched the published sequence except at two positions, nucleotides 798 
and 799. In the published sequence these are TC, while in the gene shown in SEQ 
ID NO: 6 they are CT. This change results in an amino acid substitution of leucine 
for serine. The reason for this difference is not known. It may be due to an error 
in the published sequence, the difference in strams used to isolate the gene, or a 
PCR-generated error. The latter seems unlikely since the same change was 
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observed in at I^BB independently isolated PCR-; 




gener; 




lapA genes. The 



difference has no apparent effect on DHDPS enzyme activity (see Example 4). 



An Nco I (CCATGG) site was inserted at the translation initiation codon of 
the E. coli dapA gene using oligonucleotide-directed mutagenesis. The 2.8 kb 
Pst I DNA fragment carrying the dapA gene in plasmid pBT427 (see Example 3) 
was inserted into the Pst I site of phagemid vector pTZ18R (Pharmacia) yielding 
pBT43 1 . The orientation of the dapA gene was such that the coding strand would 
be present on the single-stranded phagemid DNA. Oligonucleotide-directed 
mutagenesis was carried out using a Muta-Gene kit from Bio-Rad according to the 
manufacturer's protocol with the mutagenic primer shown below: 

SEQ ID NO:7: 

CTTCCCGTGA CCATGGGCCA TC 

Putative mutants were screened for the presence of an Nco I site and a plasmid, 
designated pBT437, was shown to have the the proper sequence in the vicinity of 
the mutation by DNA sequencing. The addition of an Nco I site at the translation 
start codon also resulted in a change of the second codon from TTC coding for 
phenylalanine to GTC coding for valine. 

To achieve high level expression of the dapA genes in E. coli the bacterial 
expression vector pBT430 (see Example 2) was used. The E. coli dapA gene was 
cut out of plasmid pBT437 as an 1 150 bp Nco I-Hind III fragment and inserted 
into the expression vector pBT430 digested with the same enzymes 9 yielding 
plasmid pBT442. For expression of the Corynebacterium dapA gene, the 910 bp 
Nco I to EcoR I fragment of SEQ ID NO:6 inserted in pBT430 (pFS766, see 
Example 3) was used. 

For high level expression each of the plasmids was transformed into E. coli 
strain BL21(DE3) [Studier et al. (1986)/. MoL Biol. i<59:113-130]. Cultures 
were grown in LB medium containing ampicillin (100 mg/L) at 25°C. At an 
optical density at 600 nm of approximately 1, IPTG (isopropylthio-j3-galactoside, 
the inducer) was added to a final concentration of 0.4 mM and incubation was 
continued for 3 h at 25°C. The cells were collected by centrifugation and 
resuspended in l/20th (or 1/1 00th) the original culture volume in 50 mM NaCl; 
50 mM Tris-Cl, pH 7.5; 1 mM EDTA, and frozen at -20°C Frozen aliquots of 
1 mL were thawed at 37°C and sonicated, in an ice-water bath, to lyse the cells. 



EXAMPLE 4 



High level expression of the E. coli and 
Corynebacterium jdutamicum dapA genes in E, coli 



The lysate was 




ged at 4°C for 5 



mui at 



15,000 rp 




te supernatant was 



removed and the pellet was resuspended in 1 mL of the above buffer. 

The supernatant and pellet fractions of uninduced and IPTG-induced cultures 
of BL21(DE3)/pBT442 or BL21(DE3)/pFS766 were analyzed by SDS 
polyacrylamide gel electrophoresis. The major protein visible by Coomassie blue 
staining in the supernatant and pellet fractions of both induced cultures had a 
molecular weight of 32-34 kd, the expected size for DHDPS. Even in the 
uninduced cultures this protein was the most prominent protein produced. 

In the BL21(DE3)/pBT442 IPTG-induced culture about 80% of the DHDPS 
protein was in the supernatant and DHDPS represented 10-20% of the total 
protein in the extract. In the BL21(DE3)/pFS766 IPTG-induced culture more than 
50% of the DHDPS protein was in the pellet fraction. The pellet fractions in both 
cases were 90-95% pure DHDPS, with no other single protein present in 
significant amounts. Thus, these fractions were pure enough for use in the 
generation of antibodies. The pellet fractions containing 2-4 mg of either E. coli 
DHDPS or Corynebacterium DHDPS were solubilized in 50 mM NaQ; 50 mM 
Tris-Cl, pH 7.5; 1 mM EDTA, 0.2 mM dithiothreitol, 0.2% SDS and sent to 
Hazelton Research Facility (310 Swampridge Road, Denver, PA 17517) to have 
rabbit antibodies raised against the proteins. 

DHDPS enzyme activity was assayed as follows: 
Assay mix (for 10 X 1.0 mL assay tubes or 40 X 0.25 mL for microtiter dish); 
made fresh, just before use: 
2.5 mL H2O 
0.5 mL 1.0 M Tris-HQ pH8.0 

0.5 mL 0.1 M Na Pyruvate 

0.5 mL o-Aminobenzaldehyde (lOmg/mL in ethanol) 

25 pL 1.0M DL-Aspartic-P-semialdehyde (ASA) in 1.0N 



HQ 



Assay (1.0 mL): 



MicroAssay (0.25mL): 



DHDPS assay mix 
enzyme extract + H2O; 
10 mM L-lysine 



0.40 mL 



0.10 mL 



0.10 mL 



.025 mL 



5 \xL or 20 uL 



1 uL or 5 uL 



Incubate at 30°C for desired time. Stop by addition of: 




0.125 mL 
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Color allowed v 




elop for 30-60 min. Precipitate spi 




in eppendorf 



centrifuge. OD^^s 0 min read as blank. For MicroAssay^liquot 0.2 mL into 
micro titer well and read at OD530. 

The specific activity of E. coli DHDPS in the supernatant fraction of induced 
extracts was about 50 OD540 units per minute per milligram protein in a 1.0 mL 
assay. E. coli DHDPS was sensitive to the presence of L-lysine in the assay. Fifty 
percent inhibition was found at a concentration of about 0.5 raM. For 
Corynebacterium DHDPS, the activity was measured in the supernatant fraction of 
uninduced extracts, rather than induced extracts. Enzyme activity was about 4 
OD530 units per min per milligram protein in a 0.25 mL assay. In contrast to 
E. coli DHDPS, Corynebacterium DHDPS was not inhibited at all by L-lysine, 
even at a concentration of 70 mM. 



Excretion of amino acids bv E. coli expressing high levels of DH DPS and/or AKHI 

The E. coli expression cassette with the E. coli dapA gene linked to the T7 
RNA polymerase promoter was isolated by digesting pBT442 (see Example 4) 
with Bgl II and BamH I separating the digestion products via agarose gel 
electrophoresis and eluting the approximately 1250 bp fragment from the gel. This 
fragment was inserted into the BamH I site of plasmids pBT461 (containing the T7 
promoter/ly£ gene) and pBT492 (containing the T7 promoter/ly§C-M4 gene). 
Inserts where transcription of both genes would be in the same direction were 
identified by restriction endonuclease analysis yielding plasmids pBT517 (T7/d^A 
+ T7/lvsC-M4) and pBT519 (T7 /dapA + T7/lysC). 

In order to induce E. coli to produce and excrete amino acids, these 
plasmids, as well as plasmids pBT442, pBT461 and pBT492 (and pBR322 as a 
control) were transformed into E. coli strain BL21(DE3) [Studier et al. (1986)/. 
MoL Biol. 759:113-130]. All of these plasmids, but especially pBT517 and 
pBT519, are somewhat unstable in this host strain, necessitating careful 
maintenance of selection for ampicillin resistance during growth. 

All strains were grown in minimal salts M9 media [see Sambrook et al. 
(1989) Molecular Cloning, A Laboratory Manual, Cold Spring Harbor Laboratory 
Press] supplemented with ampicillin to maintain selection for the plasmids 
overnight at 37°C. Cultures were collected when they reached an ODgoo of 1. 
Cells were removed by centrifiigation and the supernatants (3 mL) were passed 
through 0.2 micron filters to remove remaining cells and large molecules. Five 
microliter aliquots of the supernatant fractions were analyzed for amino acid 
composition with a Beckman Model 6300 amino acid analyzer using post-column 
ninhydrin detection. Results are shown in Table 1. 
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TABLE 1 

Amino Acid Concentration in Culture Supernatant^ [mM] 



Plasmid 


Lvs 


m 


Met 


Ala 


YaJ 


Asp 


Glu 


pBR322 


0 


0 


0 


0.05 


0.1 


0 


0 


pBT442 


0.48 


0 


0 


0.04 


0.06 


0 


0 


pBT461 


0.14 


0.05 


0 


0.02 


0.03 


0 


0 


pBT492 


0.16 


0.07 


0 


0.02 


0.03 


0 


0 


pBT517 


0.18 


0 


0.01 


0 


0 


0.02 


0.02 


pBT519 


0.14 


0 


0.01 


0 


0 


0.01 


0 



All of the plasmids, except the pBR322 control, lead to the excretion of 
lysine into the culture medium. Expression of the lvsC or the lvsC-M4 gene lead to 
both lysine and threonine excretion. Expression of lysC -M4 + dapA lead to 
excretion of lysine, methionine, aspartic acid and glutamic acid, but not threonine. 
In addition, alanine and valine were not detected in the culture supernatant 
Similar results were obtained with lvsC + dapA . except that no glutamic acid was 
excreted. 

EXAMPLE 6 

Construction of Chimeric dapA. lvsC and lysC-M4 Genes for Expression in Plants 

Several gene expression cassettes were used for construction of chimeric 
genes for expression of eco dapA . co rdapA . lvsC and lysC -M4 in plants. A leaf 
expression cassette (Figure 4a) is composed of the 35S promoter of cauliflower 
mosaic virus [Odell et al.(1985) Nature 31 J:810-812; Hull et al. (1987) Virology 
5(5:482-493], the translation leader from the chlorophyll a/b binding protein (Cab) 
gene, [Dunsmuir (1985) Nucleic Acids Res. 75:2503-2518] and 3' transcription 
termination region from the nopaline synthase (Nos) gene [Depicker et al. (1982) 
7. Mol. AppL Genet. 7:561-570]. Between the 5' and 3' regions are the restriction 
endonuclease sites Nco I (which includes the ATG translation initiation codon), 
EcoR I, Sma I and Kpn I. The entire cassette is flanked by Sal I sites; there is also 
a BamH I site upstream of the cassette. 

A seed-specific expression cassette (Figure 4b) is composed of the promoter 
and transcription terminator from the gene encoding the J3 subunit of the seed 
storage protein phaseolin from the bean Phaseohts vulgaris [Doyle et al. (1986) J. 
Biol. Chem. 2(57:9228-9238]. The phaseolin cassette includes about 500 
nucleotides upstream (5 f ) from the translation initiation codon and about 1650 
nucleotides downstream (3') from the translation stop codon of phaseolin. 
Between the 5' and 3' regions are the unique restriction endonuclease sites Nco I 
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(which include; 



ATG translation initiation c 



odon), Si^^BKpn I and Xba I. 




The entire cassette is flanked by Hind III sites. 

A second seed expression cassette was used for the co rdapA gene. This was 
composed of the promoter and transcription terminator from the soybean Kunitz 
trysin inhibitor 3 (KTI3) gene [Jofuku et al. (1989) Plant Cell 7:427-435]. The 
KTI3 cassette includes about 2000 nucleotides upstream (5') from the translation 
initiation codon and about 240 nucleotides downstream (3') from the translation 
stop codon of phaseolin. Between the 5' and 3* regions are the unique restriction 
endonuclease sites Nco I (which includes the ATG translation initiation codon), 
Xba I, Kpn I and Sma I. The entire cassette is flanked by BamH I sites. 

A constitutive expression cassette for corn was used for expression of the 
lvsC -M4 gene and the eco dapA gene. It was composed of a chimeric promoter 
derived from pieces of two com promoters and modified by in vitro site-specific 
mutagenesis to yield a high level constitutive promoter and a 3' region from a com 
gene of unknown function. Between the 5* and 3' regions are the unique restriction 
endonuclease sites Nco I (which includes the ATG translation initiation codon), 
Sma I and Bgl XI. The nucleotide sequence of the constitutive com expression 
cassette is shown in SEQ ID NO:93. 

Plant amino acid biosynthetic enzymes are known to be localized in the 
chloroplasts and therefore are synthesized with a chloroplast targeting signal. 
Bacterial proteins such as DHDPS and AKIII have no such signal. A chloroplast 
transit sequence (cts) was therefore fused to the ecodagA, co rdapA . lysC . and 
lvsC-M4 coding sequence in some chimeric genes. The cts used was based on the 
the cts of the small subunit of ribulose 1,5-bisphosphate carboxylase from soybean 
[Berry-Lowe et al. (1982) /. hfol AppL Genet. 7:483-498]. The oligonucleotides 
SEQ ID NOS:8-ll were synthesized and used as described below. For com the 
cts used was based on the the cts of the small subunit of ribulose 1,5-bisphosphate 
carboxylase from com [Lebrun et al. (1987) Nucleic Acids Res. 15:4360] and is 
designated mcts to distinguish it from the soybean cts. The oligonucleotides SEQ 
ID NOS: 17-22 were synthesized and used as described below. 
Fourteen chimeric genes were created: 



No. 


1) 


35S promoter/Cab leader/lygC/Nos 3' 


No. 


2) 


35S promoter/Cab leader/cts/lYsC/Nos 3' 


No. 


3) 


35S promoter/Cab leader/cts/lYsC-M4/Nos 3' 


No. 


4) 


phaseolin 5' region/cts/l^sC/phaseolin 3' region 


No. 


5) 


phaseolin 5' region/cts/rysC-NHfchaseolin 3' region 


No. 


6) 


35S promoter/Cab leader/ecodapA/Nos 3' 


No. 


7) 


35S promoter/Cab leader/cts/ecodap£/Nos 3 
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phaseolin 5' region/ecoda^A/phase^J^p' region 
phaseolin 5 f region/cts/ecodapA/fchaseolin 3' reg: 



No. 

No. 9^ phaseolin 5* region/cts/ecodapA^phaseolin 3' region 
No. 10) 35S promoter/Cab leader/cts/cordaEA/Nos 3 
No. 11) phaseolin 5' region/cts/cordapA/phaseolin 3* region 
5 No. 12) KTI3 5' region/cts/cordapA/KTB 3* region 

No. 13) HH534 5' region/mcts/ly^-M4/HH2-l 3' region 
No. 14) HH534 5' region/mcts/ecodapA/HH2-l 3' region 
A 1440 bp Nco I-Hpa I fragment containing the entire lysC coding region 
plus about 90 bp of 3' non-coding sequence was isolated from an agarose gel 
10 following electrophoresis and inserted into the leaf expression cassette digested 
with Nco I and Sma I (chimeric gene No. 1), yielding plasmid pBT483. 

Oligonucleotides SEQ ID NO:8 and SEQ ID NO:9, which encode the 
carboxy terminal part of the chloroplast targeting signal, were annealed, resulting 
in Nco I compatible ends, purified via polyacrylamide gel electrophoresis, and 
U 1 5 inserted into Nco I digested pBT46 1 . The insertion of the correct sequence in the 

O correct orientation was verified by DNA sequencing yielding pBT496. 

nj Oligonucleotides SEQ ID NO: 10 and SEQ ID NO: 1 1 , which encode the amino 

W terminal part of the chloroplast targeting signal, were annealed, resulting in Nco I 

Sj compatible ends, purified via polyacrylamide gel electrophoresis, and inserted into 

ffi 20 Nco I digested pBT496. The insertion of the correct sequence in the correct 
? , orientation was verified by DNA sequencing yielding pBT521. Thus the cts was 

rU fused to the lysC gene. 

f * To fuse the cts to the lvsC -M4 gene, pBT521 was digested with Sal I, and an 

CI approximately 900 bp DNA fragment that included the cts and the amino terminal 

^ 25 coding region of IvsC was isolated. This fragment was inserted into Sal I digested 
pBT492, effectively replacing the amino terminal coding region of lysC-M4 with 
the fused cts and the amino terminal coding region of IvsC . Since the mutation that 
resulted in lysine-insensitivity was not in the replaced fragment, the new plasmid, 
pBT523, carried the cts fused to lysC -M4. 
30 The 1600 bp Nco I-Hpa I fragment containing the cts fused to lysC plus 

about 90 bp of 3 f non-coding sequence was isolated and inserted into the leaf 
expression cassette digested with Nco I and Sma I (chimeric gene No. 2), yielding 
plasmid pBT541 and the seed-specific expression cassette digested with Nco I and 
Sma I (chimeric gene No. 4), yielding plasmid pBT543. 
35 Similarly, the 1600 bp Nco I-Hpa I fragment containing the cts fused to 

lysC -M4 plus about 90 bp of 3' non-coding sequence was isolated and inserted into 
the leaf expression cassette digested with Nco I and Sma I (chimeric gene No. 3), 
yielding plasmid pBT540 and the seed-specific expression cassette digested with 
Nco I and Sma I (chimeric gene No. 5), yielding plasmid pBT544. 
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Before ir^^on into the expression cassettes, the-^^^£A gene was 
modified to insert a restriction endonuclease site, Kpn I, just after the translation 
stop codon. The oligonucleotides SEQ ID NOS: 12-13 were synthesized for this 
purpose: 



SEQ ID NO: 12: 

CCGGTTTGCT GTAATAGGTA CCA 



SEQ ID NO: 13: 

AGCTTGGTAC CTATTACAGC AAACCGGCAT G 



Oligonucleotides SEQ ID NO: 12 and SEQ ID NO: 13 were annealed, 
resulting in an Sph I compatible end on one end and a Hind DI compatible end on 
the other and inserted into Sph I plus Hind EH digested pBT437. The insertion of 
the correct sequence was verified by DNA sequencing yielding pBT443. 

An 880 bp Nco I-Kpn I fragment from pBT443 containing the entire 
ec odapA coding region was isolated from an agarose gel following electrophoresis 
and inserted into the leaf expression cassette digested with Nco I and Kpn I 
(chimeric gene No. 6), yielding plasmid pBT450 and into the seed-specific 
expression cassette digested with Nco I and Kpn I (chimeric gene No. 8), yielding 
plasmid pBT494. 

Oligonucleotides SEQ ID NO:8 and SEQ ID NO:9, which encode the 
carboxy terminal part of the chloroplast targeting signal, were annealed resulting in 
Nco I compatible ends, purified via polyacrylamide gel electrophoresis, and 
inserted into Nco I digested pBT450. The insertion of the correct sequence in the 
correct orientation was verified by DNA sequencing yielding pBT451. A 950 bp 
Nco I-Kpn I fragment from pBT451 encoding the carboxy terminal part of the 
chloroplast targeting signal fused to the entire eco dapA coding region was isolated 
from an agarose gel following electrophoresis and inserted into the seed-specific 
expression cassette digested with Nco I and Kpn I, yielding plasmid pBT495. 
Oligonucleotides SEQ ID NO: 10: and SEQ ID NO: 11:, which encode the amino 
terminal part of the chloroplast targeting signal, were annealed resulting in Nco I 
compatible ends, purified via polyacrylamide gel electrophoresis, and inserted into 
Nco I digested pBT451 and pBT495. Insertion of the correct sequence in the 
correct orientation was verified by DNA sequencing yielding pBT455 and pBT520, 
respectively. Thus the cts was fused to the eco dapA gene in the leaf expression 
cassette (chimeric gene No. 7) and the seed-specific expression cassette (chimeric 
gene No. 9). 
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An 870 bj^> I-EcoR I fragment from pFS766 cd^^iing the entire 
cor dapA coding region was isolated from an agarose gel following electrophoresis 
and inserted into the leaf expression cassette digested with Nco I and EcoR I, 
yielding plasmid pFS789. To attach the cts to the co rdapA gene, a DNA fragment 
containing the entire cts was prepared using PCR. The template DNA was 
pBT540 and the oligonucleotide primers used were: 



SEQ ID NO: 14: 

GCTTCCTCAA TGATCTCCTC CCCAGCT 



SEQ ID NO: 15: 

CATTGTACTC TTCCACCGTT GCTAGCAA 



PCR was performed using a Perkin-Elmer Cetus kit according to the 
instructions of the vendor on a thermocycler manufactured by the same company. 
The PCR-generated 160 bp fragment was treated with T4 DNA polymerase in the 
presence of the 4 deoxyribonucleotide triphosphates to obtain a blunt-ended 
fragment. The cts fragment was inserted into pFS789 which had been digested 
with Nco I and treated with the Klenow fragment of DNA polymerase to fill in the 
5' overhangs. The inserted fragment and the vector/Insert junctions were 
determined to be correct by DNA sequencing, yielding pFS846 containing chimeric 
gene No. 10.. 

A 1030 bp Nco I-Kpn I fragment from pFS846 containing the cts attached to 
the co rdapA coding region was isolated from an agarose gel following electro- 
phoresis and inserted into the phaseolin seed expression cassette digested with 
Nco I and Kpn I, yielding plasmid pFS889 containing chimeric gene No. 1 1. 
Similarly, the 1030 bp Nco I-Kpn I fragment from pFS846 was inserted into the 
KT13 seed expression cassette digested with Nco I and Kpn I, yielding plasmid 
pFS862 containing chimeric gene No. 12. 

Oligonucleotides SEQ ID NO:94 and SEQ ID NO:95, which encode the 
carboxy terminal part of the com chloroplast targeting signal, were annealed, 
resulting in Xba I and Nco I compatible ends, purified via polyacrylamide gel 
electrophoresis, and inserted into Xba I plus Nco I digested pBT492 (see Example 
2). The insertion of the correct sequence was verified by DNA sequencing yielding 
pBT556. Oligonucleotides SEQ ID NO:96 and SEQ ID NO:97, which encode the 
middle part of the chloroplast targeting signal, were annealed, resulting in Bgl II 
and Xba I compatible ends, purified via polyacrylamide gel electrophoresis, and 
inserted into Bgl II and Xba I digested pBT556. The insertion of the correct 
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sequence was \JA?d by DNA sequencing yielding pBT Oligonucleotides 
SEQ ID NO:98 and SEQ ID NO:99, which encode the amino terminal part of the 
chloroplast targeting signal, were annealed, resulting in Nco I and Afl II compatible 
ends, purified via poly aery lamide gel electrophoresis, and inserted into Nco I and 
Afl II digested pBT557. The insertion of the correct sequence was verified by 
DNA sequencing yielding pBT558. Thus the mcts was fused to the lysC-M4 gene. 

A 1 .6 kb Nco I-Hpa I fragment from pBT558 containing the mcts attached 
to the lysC -M4 gene was isolated from an agarose gel following electrophoresis 
and inserted into the constitutive corn expression cassette digested with Nco I and 
Sma I, yielding plasmid pBT573 containing chimeric gene No. 13. 

To attach the mcts to the eco dapA gene a DNA fragment containing the 
entire mcts was prepared using PCR as described above. The template DNA was 
pBT558 and the oligonucleotide primers used were: 

SEQ ID NO: 100: 
GCGCCCACCG TGATGA 

SEQIDNO:101: 
CACCGGATTC TTCCGC 

The mcts fragment was inserted into pBT450 (above) which had been 
digested with Nco I and treated with the Klenow fragment of DNA polymerase to 
fill in the 5* overhangs. The inserted fragment and the vector/insert junctions were 
determined to be correct by DNA sequencing, yielding pBT576. Plasmid pBT576 
was digested with Asp718, treated with the Klenow fragment of DNA polymerase 
to yield a blunt-ended fragment, and then digested with Nco I. The resulting 1030 
bp Nco I-blunt-ended fragment containing the ecodagA gene attached to the mcts 
was isolated from an agarose gel following electrophoresis. This fragment was 
inserted into the constitutive com expression cassette digested with Bgl II, treated 
with the Klenow fragment of DNA polymerase to yield a blunt-ended fragment, 
and then digested with Nco I, yielding plasmid pBT583 containing chimeric gene 
No. 14. 

EXAMPLE 7 

Transformation of Tobacco with the 35S Promoter/lvsC Chimeri c Genes 
Transformation of tobacco with the 35S promoter/lysC chimeric genes was 

effected according to the following: 

The 35S promoter/Cab leader/ly£/Nos 3\ 35S promoter/Cab 

leader/cts/Ix^/Nos 3\ and 35S promoter/Cab leader/cts/lys£-M4/Nos 3* chimeric 
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id inserted into 
i^ids pBT497, 



pBT545 and pBT542, respectively. The vector is part of a binary Ttplasmid 
vector system [Bevan, (1984) Nucl. Acids. Res. 72:8711-8720] of Agrobacterium 
5 tumefaciens. The vector contains: (1) the chimeric gene nopaline synthase 

promoter/neomycin phosphotransferase coding region (nos:NPT II) as a selectable 
marker for transformed plant cells [Bevan et al. (1983) Nature 304: 184-186]; 
(2) the left and right borders of the T-DNA of the Ti plasmid [Bevan (1984) Nucl. 
Acids. Res. 72:871 1-8720]; (3) the E. coli lacZ a-complementing segment [Vieria 
10 and Messing (1982) Gene 79:259-267] with unique restriction endonuclease sites 
for EcoR I, Kpn I, BamH I and Sal I; (4) the bacterial replication origin from the 
Pseudomonas plasmid pVSl [Itoh et al. (1984) Plasmid 77:206-220]; and (5) the 
bacterial neomycin phosphotransferase gene from Tn5 [Berg et al. (1975) Proc. 
Natl. Acad Sci. USA. 72:3628-3632] as a selectable marker for transformed A. 
15 tumefaciens. 

The 35S promoter/Cab leader/cts/ly^/Nos 3\ and 35S promoter/Cab 
leader/cts/lysC-M4/Nos 3' chimeric genes were also inserted into the binary vector 
pBT456, yielding pBT547 and pBT546, respectively. This vector is pZS97K, into 
which the chimeric gene 35S promoter/Cab leader/cts/dapA/Nos 3* had previously 
20 been inserted as a BamH I-Sal I fragment (see Example 9). In the cloning process 
large deletions of the dapA chimeric gene occurred. As a consequence these 
plasmids are equivalent to pBT545 and pBT542, in that the only transgene 
expressed in plants (other than the selectable marker gene, NPT II) was 35S 
promoter/Cab leader/ctsyflysC/Nos 3' or 35S promoter/Cab leader/cts/lysC-M4/Nos 



The binary vectors containing the chimeric lysC genes were transferred by 
tri-parental matings [Ruvkin et al. (1981) Nature 289:85-88] to Agrobacterium 
strain LBA4404/pAL4404 [Hockema et al (1983), Nature 505:179-180]. The 
Agrobacterium transformants were used to inoculate tobacco leaf disks [Horsch et 
al. (1985) Science 227:1229-1231]. Transgenic plants were regenerated in 
selective medium containing kanamycin. 

To assay for expression of the chimeric genes in leaves of the transformed 
plants, protein was extracted as follows. Approximately 2.5 g of young plant 
leaves, with the midrib removed, were placed in a dounce homogenizer with 0.2 g 
of polyvinyl polypyrrolidone and 1 1 mL of 50mM Tris-HCl pH8.0, 50mM NaCl, 
ImM EDTA (TNE) and ground thoroughly. The suspension was further 
homogenized by a 20 sec treatment with a Brinkman Polytron Homogenizer 
operated at setting 7. The resultant suspensions were centrifuged at 16,000 ipm 
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for 20 m in at 4° a Dupont-Sorvall superspeed centrifi^Bsing an SS34 rotor 
to remove particulates. The supernatant was decanted, the volume was adjusted to 
be 10 mL by addition of TNE if necessary, and 8 mL of cold, saturated ammonium 
sulfate was added. The mixture was set on ice for 30 min and centrifuged as 
described above. The supernatant was decanted and the pellet, which contained 
the AKffl protein, was resuspended in 1 mL of TNE and desalted by passage over 
a Sephadex G-25 M column (Column PD-10, Pharmacia). 

For immunological characterization, three volumes of extract were mixed 
with 1 volume of 4 X SDS-gel sample buffer (0.17M Tris-HCl pH6.8, 6.7% SDS, 
16.7% f}-mercaptoethanol, 33% glycerol) and 3 jxL from each extract were run per 
lane on an SDS polyacrylamide gel, with bacterially produced AKJH serving as a 
size standard and protein extracted from untransformed tobacco leaves serving as a 
negative control. The proteins were then electrophoretically blotted onto a 
nitrocellulose membrane (Western Blot). The membranes were exposed to the 
AKTTT antibodies prepared as described in Example 2 at a 1:5000 dilution of the 
rabbit serum using standard protocol provided by BioRad with their Immun-Blot 
Kit. Following rinsing to remove unbound primary antibody, the membranes were 
exposed to the secondary antibody, donkey anti-rabbit Ig conjugated to 
horseradish peroxidase (Amersham) at a 1:3000 dilution. Following rinsing to 
remove unbound secondary antibody, the membranes were exposed to Amersham 
chemiluminescence reagent and X-ray film. 

Seven of thirteen transformants containing the chimeric gene, 35S 
promoter/Cab leader/cts/ly^-M4/Nos 3', and thirteen of seventeen transformants 
containing the chimeric gene, 35S promoter/Cab leader/cts/]y§C/Nos 3', produced 
AKTTT protein (Table 2). In all cases protein which reacted with the AKHI 
antibody was of several sizes. Approximately equal quantities of proteins equal in 
size to AKTTT produced in E. coli, and a protein about 6 kd larger were evident in 
all samples, suggesting that the chloroplast targeting signal had been removed from 
about half of the protein synthesized. This further suggests that about half of the 
the protein entered the chloroplast. In addition, a considerable amount" of protein 
of higher molecular weight was observed. The origin of this protein is unclear; the 
total amount present was equal or slightly greater than the amounts of the mature 
and putative AK1H precursor proteins combined. 

The leaf extracts were assayed for AK activity as described in Example 2. 
AKHI could be distinguished from endogenous AK activity, if it were present, by 
its increased resistance to lysine plus threonine. Unfortunately, however, this assay 
was not sensitive enough to reliably detect AKIII activity in these extracts. Zero of 
four transformants containing the chimeric gene, 35S promoter/Cab 
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leader /IysC/ NoslJghowed AKHI activity. Only one extrl^^rom a transformant 
containing the 35S promoter/Cab leader/cts /IvsC -M4/Nos 3' gene, produced a 
convincing level of enzyme activity. This came from transformant 546-49A, and 
was also the extract that showed the highest level of AKIII-M4 protein via 
Western blot. 

An alternative method to detect the expression of active AKHI enzyme was 
to evaluate the sensitivity or resistance of leaf tissue to high concentrations of 
lysine plus threonine. Growth of cell cultures and seedlings of many plants is 
inhibited by high concentrations of lysine plus threonine; this is reversed by 
addition of methionine (or homoserine which is converted to methionine in vivo). 
Lysine plus threonine inhibition is thought to result from feedback inhibition of 
endogenous AK, which reduces flux through the pathway leading to starvation for 
methionine. In tobacco there are two AK enzymes in leaves, one lysine-sensitive 
and one threonine sensitive [Negrutui et al. (1984) Theor. Appl. Genet. 68:11-20]. 
High concentrations of lysine plus threonine inhibit growth of shoots from tobacco 
leaf disks and inhibition is reversed by addition of low concentrations of 
methionine. Thus, growth inhibition is presumably due to inhibition of the two AK 
isozymes. 

Expression of active lysine and threonine insensitive AKUI-M4 would be 
predicted to reverse the growth inhibition. As can be seen in Table 2, this was 
observed. There is, in fact, a good correlation between the level of AKQI-M4 
protein expressed and the resistance to lysine plus threonine inhibition. Expression 
of lysine-sensitive wild type AKHI does not have a similar effect. Only the highest 
expressing transformant showed any resistance to lysine plus threonine inhibition, 
and this was much less dramatic than that observed with AKIQ-M4. 

To measure free amino acid composition of the leaves, free amino acids were 
extracted as follows. Approximately 30-40 mg of young leaf tissue was chopped 
with a razor and dropped into 0.6 mL of methanol/ chloroform/water mixed in 
ratio of 12v/5v/3v (MCW) on dry ice. After 10-30 min the suspensions were 
brought to room temperature and homogenized with an Omni 1000 Handheld 
Rechargeable Homogenizer and then centrifuged in an eppendorf microcentrifuge 
for 3 min. Approximately 0.6mL of supernatant was decanted and an additional 
0.2mL of MCW was added to the pellet which was then vortexed and centrifuged 
as above. The second supernatant, about 0.2mL, was added to the first. To this, 
0.2mL of chloroform was added followed by 0.3mL of water. The mixture was 
vortexed and the centrifuged in an eppendorf microcentrifuge for about 3 min, the 
upper aqueous phase, approximately 1 .OmL, was removed, and was dried down in 
a Savant Speed Vac Concentrator. One-tenth of the sample was run on a Beckman 
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Model 6300 am cid analyzer using post-column runhyf^ detection. Relative 
free amino acid levels in the leaves were compared as ratios of lysine or threonine 
to leucine, thus using leucine as an internal standard. There was no consistent 
effect of expression of AKHI or AKTII-M4 on the lysine or threonine (or any other 
amino acid) levels in the leaves (Table 2). 
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TABLE 2 

BT542 transform ants: 35S promoter/Cab leader/cts /lvsC -M4/Nos 3' 
BT545 transform ants: 35S promoter/Cab leader/cts /lvsC/ Nos 3' 
BT546 transform ants: 35S promoter/Cab leader/cts /IysC -M4/Nos 3' 
BT547 transformants: 35S promoter/Cab leader/cts/bsG/Nos 3' 



LINE 



542-5B 

542-26A 

542-27B 

542-35A 

542-54A 

542-57B 

545-5A 

545-7B 

545-17B 

545-27A 

545-50E 

545-52A 



FREE AMINO 
ACIDS/LEAF 
K/L T/L 



0.5 3.5 

0.5 3.3 

0.5 3.4 

0.5 4.3 

0.5 2.8 

0.5 3.4 

n.cL n.cL 

0.5 3.4 

0.6 2.5 

0.6 3.5 

0.6 3.6 

0.5 3.6 



AKm 

ACTIVITY 
U/MG/HR 



0 
0 
0 

0.01 

0 

0 

0.02 
0 

0.01 
0 

0.03 
0.02 



WESTERN 
BLOT 



RESISTANCE 
TO Lys 3mM 
+ Thr 3mM 



+ 
+ 



546-4A 

546-24B 

546-44A 

546-49A 

546-54A 

546-56B 

546-58B 



0.4 4.5 

0.6 4.9 

0.5 6.0 

0.7 7.0 

0.5 6.4 

0.5 4.4 

0.6 8.0 



0 

0.04 
0.03 
0.10 
0 

0.01 
0 



547-3D 
547-8B 
547-9A 



0.4 5.4 
0.6 5.0 
0.5 4.3 



0 

0.02 
0.03 



547- 12A 
547-15B 



0.7 3.9 
0.6 4.5 



0 
0 
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547- I6A 
547- 18 A 
547-22A 
547-25C 
547-28C 
547-29C 



.5 3.6 

0.5 4.0 

0.8 4.4 

0.5 4.3 

0.6 5.6 

0.5 3.8 



• 



EXAMPLE 8 

Transformation of Tobacco with the Phaseolin Promoter/lvsC Chimeric Genes 
The phaseolin promoter /lvsC chimeric gene cassettes, phaseolin 5' 
region/cts/lysQ^haseolin 3' region, and phaseolin 5' region/cts /lysC -M4/phaseolin 
y region (Example 6) were isolated as approximately 3.3 kb Hind HI fragments. 
These fragments were inserted into the unique Hind DI site of the binary vector 
pZS97 (Figure 6) yielding pBT548 and pBT549, respectively. This vector is 
similar to pZS97K described in Example 7 except for the presence of two 
additional unique cloning sites, Sma I and Hind EH, and the bacterial 0-lactamase 
gene (causing ampicillin resistance) as a selectable marker for transformed 
A. tumefaciens instead of the bacterial neomycin phosphotransferase gene. 

The binary vectors containing the chimeric lvsC genes were transferred by 
tri-parental matings to Agrobacterium strain LBA4404/pAL4404, the 
Agrobacterium transformants were used to inoculate tobacco leaf disks and 
transgenic plants regenerated by the methods set out in Example 7. 

To assay for expression of the chimeric genes in the seeds of the transformed 
plants, the plants were allowed to flower, self-pollinate and go to seed Total 
proteins were extracted from mature seeds as follows. Approximately 30-40 mg of 
seeds were put into a 1 .5mL disposable plastic microfuge tube and ground in 
0.25mLof 50mMTris-HClpH6.8, 2mM EDTA, 1% SDS, 1% 0-mercaptoethanol. 
The grinding was done using a motorized grinder with disposable plastic shafts 
designed to fit into the microfuge tube. The resultant suspensions were centrifuged 
for 5 min at room temperature in a microfuge to remove particulates. Three 
volumes of extract was mixed with 1 volume of 4 X SDS-gel sample buffer (0.17M 
Tris-HCl pH6.8, 6.7% SDS, 16.7% 0-mercaptoethanol, 33% glycerol) and 5^lL 
from each extract were run per lane on an SDS polyacrylamide gel, with bacterially 
produced AKJH serving as a size standard and protein extracted from 
untransformed tobacco seeds serving as a negative control. The proteins were then 
electrophoretically blotted onto a nitrocellulose membrane. The membranes were 
exposed to the AKHI antibodies (prepared as described in Example 2) at a 1:5000 
dilution of the rabbit serum using standard protocol provided by BioRad with their 
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Immun-Blot Kiti^^llowing rinsing to remove unbound pl^^ry antibody the 
membranes were exposed to the secondary antibody, donkey anti-rabbit Ig 
conjugated to horseradish peroxidase (Amersham) at a 1:3000 dilution. Following 
rinsing to remove unbound secondary antibody, the membranes were exposed to 
Amersham chemilum inescence reagent and X-ray film. 

Ten of eleven transformants containing the chimeric gene, phaseolin 5' 
region/cts/lysC/phaseolin 3' region, and ten of eleven transformants containing the 
chimeric gene, phaseolin 5' region/cts /lysC -M4/phaseolin 3' region, produced 
AKin protein (Table 3). In all cases protein which reacted with the AKHI 
antibody was of several sizes. Approximately equal quantities of proteins equal in 
size to AKQI produced in E. coh, and about 6 kd larger were evident in all 
samples, suggesting that the chloroplast targeting signal had been removed from 
about half of the protein synthesized. This further suggests that about half of the 
the protein entered the chloroplast. In addition, some proteins of lower molecular 
weight were observed, probably representing breakdown products of the AKHI 
polypeptide. 

To measure free amino acid composition of the seeds, free amino acids were 
extracted from mature seeds as follows. Approximately 30-40 mg of seeds and an 
approximately equal amount of sterilized sand were put into a 1 .5mL disposable 
plastic microfuge tube along with 0.2mL of methanol/chloroform/water mixed in 
ratio of 12v/5v/3v (MCW) at room temperature. The seeds were ground using a 
motorized grinder with disposable plastic shafts designed to fit into the microfuge 
tube. After grinding an additional 0.5mL of MCW was added, the mixture was 
vortexed and then centrifuged in an eppendorf microcentrifuge for about 3 min. 
Approximately 0.6mL of supernatant was decanted and an additional 0.2mL of 
MCW was added to the pellet which was then vortexed and centrifuged as above. 
The second supernatant, about 0.2mL, was added to the first. To this, 0.2mL of 
chloroform was added followed by 0.3mL of water. The mixture was vortexed 
and then centrifuged in an eppendorf microcentrifuge for about 3 min, the upper 
aqueous phase, approximately l.OmL, was removed, and was dried down in a 
Savant Speed Vac Concentrator. The samples were hydrolyzed in 6N hydrochloric 
acid, 0.4% p-mercaptoethanol under nitrogen for 24 h at 1 10-120°C; 1/4 of the 
sample was ran on a Beckman Model 6300 amino acid analyzer using post-column 
ninhydrin detection. Relative free amino acid levels in the seeds were compared as 
ratios of lysine, methionine, threonine or isoleucine to leucine, thus using leucine as 
an internal standard. 

To measure the total amino acid composition of the seeds, 6 seeds were 
hydrolyzed in 6N hydrochloric acid, 0.4% P-mercaptoethanol under nitrogen for 
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e was run on a BeclJ 




Model 6300 amino 



acid analyzer using post-column ninhydrin detection. Relative amino acid levels in 
the seeds were compared as ratios of lysine, methionine, threonine or isoleucine to 
leucine, thus using leucine as an internal standard. Because the transgene was 
5 segregating in these self-pollinated progeny of the primary transformant and only 
six seeds were analyzed, there was expected to be some sampling error. Therefore, 
the measurement was repeated multiple times for some of the lines (Table 3). 

Expression of the cts /lysC gene in the seeds resulted in a 2 to 4-fold increase 
in the level of free threonine in the seeds and a 2 to 3-fold increase in the level of 
10 free lysine in some cases. There was a good correlation between transform ants 
expressing higher levels of AKUI protein and those having higher levels of free 
threonine, but this was not the case for lysine. These relatively small increases of 
free threonine or lysine were not sufficient to yield detectable increases in the levels 
of total threonine or lysine in the seeds. Expression of the cts/IysC-M4 gene in the 
15 seeds resulted in a 4 to 23-fold increase in the level of free threonine in the seeds 
and a 2 to 3 -fold increase in the level of free lysine in some cases. There was a 
good correlation between transformants expressing higher levels of AKUI protein 
and those having higher levels of free threonine, but this was again not the case for 
lysine. The larger increases of free threonine were sufficient to yield detectable 
20 increases in the levels of total threonine in the seeds. Sixteen to twenty-five 

percent increases in total threonine content of the seeds were observed in three 
lines which were sampled multiple times. (Isoleucine to leucine ratios are shown 
for comparison.) The lines that showed increased total threonine were the same 
ones the showed the highest levels of increase in free threonine and high expression 
25 of the AKHI-M4 protein. From these results it can be estimated that free threonine 
represents about 1% of the total threonine present in a normal tobacco seed, but 
about 18% of the total threonine present in seeds expressing high levels of 
AKHI-M4. 

TABLE 3 

BT548 Transformants: phaseolin 5' region/cts/lygC/phaseolin 3' 
BT549 Transformants: phaseolin 5* region/cts/bsC-M4/phaseolin 3* 



LINE 



SEED 
FREE AMINO ACID 
K/L T/L I/L 



SEED 
TOTAL AMINO ACID 
K/L T/L l/L 



WESTERN 



NORMAL 



0.49 1.34 0.68 



0.35 0.68 0.63 



548-2A 



1.15 2.3 0.78 



0.43 0.71 0.67 



548-4D 



0.69 5.3 0.80 



0.35 0.69 0.65 



548-6A 



0.39 3.5 0.85 



0.35 0.69 0.64 



+ 
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548-7A 


^032 4.2 0.83 


0.36 0.68 O.tiJ 




548- 14A 


0.41 3.1 0.82 


0.32 0.67 0.65 


+ 


548-18A 


0.51 1.5 0.69 


0.37 0.67 0.63 




548-22A 


1.41 2.9 0.75 


0.47 0.74 0.65 


+++ 


548-24A 


0.73 3.7 0.81 


0.38 0.68 0.65 


++ 


548-41 A 


0.40 2.8 0.77 


0.37 0.68 0.65 


+ 


548-50A 


0.46 4.0 0.81 


0.33 0.68 0.65 


+ 


548-57A 


0.50 3.8 0.80 


0.33 0.67 0.65 


++ 


549-5A 


0.63 5.9 0.69 


0.32 0.65 0.65 


+ 


549-7A 


0.51 8.3 0.78 


0.33 0.67 0.63 


++ 


549-20A 


0.67 30 0.88 


0.38* 0.82* 0.65* 


MM 


549-34A 


0.43 1.3 0.69 


0.32 0.64 0.63 




549-39D 


0.83 16 0.83 


0.35 0.71 0.63 


+++ 


549-40A 


0.80 4.9 0.74 


0.33 0.63 0.64 


+ 


549-41C 


0.99 13 0.80 


0.38*0.79*0.65* 


+++ 


549-46A 


0.48 7.7 0.84 


0.34 0.70 0.64 


+ 


549-52A 


0.81 9.2 0.80 


0.39 0.70 0.65 


++ 


549-57A 


0.60 15 0.77 


0.35* 0.85* 0.64* 


+++ 


549-60D 0.85 11 0.79 0.37 0.73 0.65 
Normal was calculated as the average of 6 samples for free amino acid 
amino acids. 


++ 

and 23 samples for total 



* Indicates average of at least 5 samples 



Seeds derived from self-pollination of two plants transformed with the 
phaseolin 5' region/cts^Oj^C-M^haseolin 3' region, plants 549-5A and 549-40A, 
showed 3 kanamycin resistant to 1 kanamycin sensitive seedlings, indicative of a 
single site of insertion of the transgene. Progeny plants were grown, self-pollinated 
and seed was analyzed for segregation of the kanamycin marker gene. Progeny 
plants that were homozygous for the transgene insert, thus containing two copies 
of the gene cassette, accumulated approximately 2 times as much threonine in their 
seed as their sibling heterozygous progeny with one copy of the gene cassette and 
about 8 times as much as seed without the gene. This demonstrates that the level 
of expression of the E. coli enzyme controls the accumulation of free threonine. 

EXAMPLE 9 

Transformation of Tobacco with the 35S Promoter/ecodapA Chimeric Genes 
The 35S promoter/Cab leader/eco dapA/ N os 3 f and 35S promoter/Cab 
leader/cts/eco^agA/Nos 3', chimeric genes were isolated as 3.1, and 3.3 kb 
BamH I-Sal I fragments, respectively and inserted into BamH I-Sal I digested 
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binary vector pZ^^ (Figure 5), yielding plasmids pBT4<^^d pBT463, 
respectively. The binary vector is described in Example 7. 

The binary vectors containing the chimeric eco dapA genes were transferred 
by tri-parental matings to Agrobacterium strain LB A4404/pAL4404, the 
Agrobacterium transfonnants used to inoculate tobacco leaf disks and the resulting 
transgenic plants regenerated by the methods set out in Example 7. 

To assay for expression of the chimeric genes in leaves of the transformed 
plants, protein was extracted as described in Example 7, with the following 
modifications. The supernatant from the first ammonium sulfate precipitation, 
approximately 18mL, was mixed with an additional 12mL of cold, saturated 
ammonium sulfate. The mixture was set on ice for 30 min and centrifuged as 
described in Example 7. The supernatant was decanted and the pellet, which 
contained the DHDPS protein, was resuspended in 1 mL of TNE and desalted by 
passage over a Sephadex G-25 M column (Column PD-10, Pharmacia). 

The leaf extracts were assayed for DHDPS activity as described in Example 
4. E. coli DHDPS could be distinguished from tobacco DHDPS activity by its 
increased resistance to lysine; E. cqIi DHDPS retained 80-90% of its activity at 
O.lmM lysine, while tobacco DHDPS was completely inhibited at that 
concentration of lysine. One of ten transfonnants containing the chimeric gene, 
35S promoter/Cab leader/eco dapA/ Nos 3 f , showed E. coli DHDPS expression, 
while five of ten transfonnants containing the chimeric gene, 35S promoter/Cab 
leader/cts/ecodagA/Nos 3' showed E, coli DHDPS expression. 

Free amino acids were extracted from leaves as described in Example 7. 
Expression of the chimeric gene, 35S promoter/Cab leader/cts/ecodapA/Nos 3\ but 
not 35S promoter/Cab leader/eco dapA/ Nos 3* resulted in substantial increases in 
the level of free lysine in the leaves. Free lysine levels from two to 90-fold higher 
than un transformed tobacco were observed. 

The transformed plants were allowed to flower, self-pollinate and go to seed. 
Seeds from several lines transformed with the 35S promoter/Cab leader/ 
cts/ec odapA/ Nos 3' gene were surface sterilized and germinated on agar plates in 
the presence of kanamycin. Lines that showed 3 kanamycin resistant to 1 
kanamycin sensitive seedlings, indicative of a single site of insertion of the 
transgenes, were identified. Progeny that were homozygous for the transgene 
insert were obtained from these lines using standard genetic analysis. The 
homozygous progeny were then characterized for expression of E. coli DHDPS in 
young and mature leaves and for the levels of free amino acids accumulated in 
young and mature leaves and in mature seeds. 
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Expression of active E. coli DHDPS enzyme was clearly evident in both 
young and mature leaves of the homozygous progeny of the transformants (Table 
4). High levels of free lysine, 50 to 100-fold higher than normal tobacco plants, 
accumulated in the young leaves of the plants, but a much smaller accumulation of 
free lysine (2 to 8-fold) was seen in the larger leaves. Experiments that measure 
lysine in the phloem suggest that lysine is exported from the large leaves. This 
exported lysine may contribute to the accumulation of lysine in the small growing 
leaves, which are known to take up, rather than export nutrients. Since the larger 
leaves make up the major portion of the biomass of the plant, the total increased 
accumulation of lysine in the plant is more influenced by the level of lysine in the 
larger leaves. No effect on the free lysine levels in the seeds of these plants was 
observed (Table 4). 



TABLE 4 

Progeny of BT463 transformants homozygous for 
35S promoter/Cab leader/cts/ecodapA/Nos 3' 



LINE 


LEAF 
SIZE 


LEAF 
FREE AMINO ACID 
K/L K/TOT 


1 COJJ 
DHDPS 
OD/607mg 


SEED FREE 
AMINO ACID 
K/L 


NORMAL 


3 in. 


0.5 


0.006 


0 


0.5 


463-18C-2 


3 in. 


47 


0.41 


7.6 


0.4 


463-18C-2 


12 in. 


1 


0.02 


5.5 




463-25A-4 


3 in. 


58 


0.42 


6.6 


0.4 


463-25A-4 


12 in. 


4 


0.02 


12.2 




463-38C-3 


3 in. 


28 


0.28 


6.1 


0.5 


463-38C-3 


12 in. 


2 


0.04 


8.3 





EXAMPLE 1Q 

Transformation of Tobacco with the Phaseolin Promoter/ecodapA Chimeric Genes 

The chimeric gene cassettes, phaseolin 5' region/ecodagA^phaseolin 3* 
region, and phaseolin 5 1 region/cts/eco dapA/ phaseolin 3' region (Example 6) were 
isolated as approximately 2.6 and 2.8 kb Hind m fragments, respectively. These 
fragments were inserted into the unique Hind ELI site of the binary vector pZS97 
(Figure 6), yielding pBT506 and pBT534, respectively. This vector is described in 
Example 8. 

The binary vectors containing the chimeric ec odapA genes were transferred 
by tri-parental matings to Agrobacterium strain LB A4404/p AL4404 , the 
Agrobacterium transformants used to inoculate tobacco leaf disks and the resulting 
transgenic plants were rcgeneratedby the methods set out in Example 7. 
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To assay for expression of the chimeric genes, the transformed plants were 
allowed to flower, self-pollinate and go to seed. Total seed proteins were 
extracted as described in Example 8 and immunologically analyzed as described in 
Example 7, with the following modification. The Western blot membranes were 
exposed to the DHDPS antibodies prepared in Example 4 at a 1:5000 dilution of 
the rabbit serum using standard protocol provided by BioRad with their Immun- 
Blot Kit. 

Thirteen of fourteen transformants containing the chimeric gene, phaseolin 5* 
region/ecodagA/phaseolin 3' region and nine of thirteen transformants containing 
the chimeric gene, phaseolin 5 f region/cts/eco dapA/ phaseolin 3' region, produced 
DHDPS protein detectable via Western blotting (Table 3). Protein which reacted 
with the DHDPS antibody was of several sizes. Most of the protein was equal in 
size to DHDPS produced in E. coli f whether or not the chimeric gene included the 
chloroplast transit sequence. This indicated that the chloroplast targeting signal 
had been efficiently removed from the precursor protein synthesized. This further 
suggests the majority of the protein entered the chloroplast. In addition, some 
proteins of lower molecular weight were observed, probably representing 
breakdown products of the DHDPS polypeptide. 

To measure free amino acid composition and total amino acid composition of 
the seeds, free amino acids and total amino acids were extracted from mature seeds 
and analyzed as described in Example 8. Expression of either the ec odapA gene or 
cts/eco dapA had no effect on the total lysine or threonine composition of the seeds 
in any of the transformed lines (Table 5). Several of the lines that were 
transformed with the phaseolin 5' region/cts/ecodapA/phaseolin 3' chimeric gene 
were also tested for any effect on the free amino acid composition. Again, not 
even a modest effect on the lysine or threonine composition of the seeds was 
observed in lines expressing high levels of E. coli DHDPS protein (Table 5). This 
was a surprising result, given the dramatic effect (described in Example 9) that 
expression of this protein has on the free lysine levels in leaves. 

One possible explanation for this was that the DHDPS protein observed via 
Western blot was not functional. To test this hypothesis, total protein extracts 
were prepared from mature seeds and assayed for DHDPS activity. Approximately 
30-40 mg of seeds were put into a 1.5 mL disposable plastic microfuge tube and 
ground in 0.25mL of 50mM Tris-HCl, 50mM NaCl, ImM EDTA (TNE). The 
grinding was done using a motorized grinder with disposable plastic shafts 
designed to fit into the microfuge tube. The resultant suspensions were centrifuged 
for 5 min at room temperature in a microfuge to remove particulates. 
Approximately 0.1 mL of aqueous supernatant was removed between the pelleted 
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material and the upper oil phase. The seed extracts were assayed for DHDPS 
activity as described in Example 4. E. coli DHDPS could be distinguished from 
tobacco DHDPS activity by its increased resistance to lysine; E. coli DHDPS 
retained about 50% of its activity at 0.4mM lysine, while tobacco DHDPS was 
completely inhibited at that concentration of lysine. High levels of E. coli DHDPS 
activity were seen in all four seed extracts tested eliminating this explanation. 

The presence of the cts sequence in the chimeric ec odapA gene was essential 
for eliciting accumulation of high levels of lysine in leaves. Thus another possible 
explanation was that the cts sequence had somehow been lost during the insertion 
of the chimeric phaseolin 5' region/cts/ecodapA/phaseolin 3* gene into the binary 
vector. PCR analysis of several of the transformed lines demonstrated the presence 
of the cts sequence, however, ruling out this possibility. 

A third explanation was that amino acids are not normally synthesized in 
seeds, and therefore the other enzymes in the pathway were not present in the 
seeds. The results of experiments presented in Example 8, wherein expression of 
phaseolin 5' region/cts/ lysC -M4/fehaseolin 3* gene resulted in accumulation of high 
levels of free threonine in seeds, indicate that this is not the case. 

Taken together these results and the results presented in Example 9, 
demonstrate that expression of a lysine-insensitive DHDPS in either seeds or leaves 
is not sufficient to achieve accumulation of increased free lysine in seeds. 

TAPlgg 

BT506 Transform ants: phaseolin 5* regjon/eco dapA/ pbaseolia 3' 
BT534 Transform ants: phaseolin 5* region/cts/eco dapA/ phaseolin 3' 

SEED: FREE SEED: TOTAL E. COLI 

AMINO ACIDS AMINO ACIDS DHDPS 



LINE 


K/L 


T/L 


K/L 


T/L 


0D/607MG 


WESTERN 


NORMAL 


0.49 


1.34 


0.35 


0.68 






506-2B 






0.34 


0.66 




+ 


506-4B 






0.33 


0.67 




+ 


506- 16A 






0.34 


0.67 




+ 


506- 17 A 






0.36 


0.55 


7.7 


+++ 


506- 19A 






0.37 


0.45 




++ 


506-22A 






0.34 


0.67 




++ 


506-23B 






0.35 


0.67 




++ 


506-33B 






0.34 


0.67 




++ 


506-38B 






0.36 


0.69 


8.7 


+++ 


506-39A 






0.37 


0.70 




++ 


506-40A 






0.36 


0.68 







68 



506-47A 






0.32 


0.68 




+++ 


506-48A 






0.33 


0.69 




i i i 


506-49A 






0.33 


0.69 




+++ 


5 34-8 A 






0.34 


0.66 




- 


534-9A 






0.36 


0.67 




++ 


534-22B 


0.43 


1.32 


0.39 


0.51 


4.9 


+++ 


534-31A 






0.34 


0.66 




- 


534-38A 


0.35 


1.49 


0.42 


0.33 




+++ 


534-39A 






0.38 


0.69 




+ 


534-7A 






0.34 


0.67 




+++ 


534-25B 






0.35 


0.67 




+++ 


534-34B 


0.80 


1.13 


0.42 


0.70 




- 


534-35A 


0.43 


1.18 


0.33 


0.67 




+++ 


534-37B 


0.42 


1.58 


0.37 


0.68 




- 


534-43A 






0.35 


0.68 




+++ 


534-48A 


0.46 


1.24 


0.35 


0.68 


6.2 


+++ 



EXAMPLE 11 

Transformation of Tobacco with the 35S Promoter/cts/dapA 
plus 35S Promoter/cts/lysC-M4 Chimeric Genes 
The 35S promoter/Cab leader/cts/eco dapA/ Nos 3\ and 35S promoter/Cab 
leader/cts/ly^-M4/Nos 3' chimeric genes were combined in the binary vector 
pZS97K (Figure 5). The binary vector is described in Example 7. An 
oligonucleotide adaptor was synthesized to convert the BamH I site at the 5 1 end of 
the 35S promoter/Cab leader/cts /lvsC -M4/Nos 3' chimeric gene (see Figure 4) to 
an EcoR I site. The 35S promoter/Cab leader/cts /lysC -M4/Nos 3* chimeric gene 
was then isolated as a 3.6 kb EcoR I fragment from plasmid pBT540 (Example 6) 
and inserted into pBT463 (Example 9) digested with EcoR I, yielding plasmid 
pBT564. This vector has both the 35S promoter/Cab leader/cts/eco dapA/ Nos 3\ 
and 35S promoter/Cab leader/cts /lysC -M4/Nos 3' chimeric genes inserted in the 
same orientation. 

The binary vector containing the chimeric eco dapA and lysC -M4 genes was 
transferred by tri-parental matings to Agrobacterium strain LB A4404/p AL4404 , 
the Agrobacterium transformants used to inoculate tobacco leaf disks and the 
resulting transgenic plants regenerated by the methods set out in Example 7. 

To assay for expression of the chimeric genes in leaves of the transformed 
plants, protein was extracted as described in Example 7 for AKHI, and as 
described in Example 9 for DHDPS. The leaf extracts were assayed for DHDPS 
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activity as described in Examples 4 and 9. E. coli DHDPS could be distinguished 
from tobacco DHDPS activity by its increased resistance to lysine; E. coli DHDPS 
retained 80-90% of its activity at 0.1 mM lysine, while tobacco DHDPS was 
completely inhibited at that concentration of lysine. Extracts were characterized 
immunologically for expression of AKHI and DHDPS proteins via Western blots as 
described in Examples 7 and 10. 

Ten of twelve transformants expressed £. coli DHDPS enzyme activity 
(Table 6). There was a good correlation between the level of enzyme activity and 
the amount of DHDPS protein detected immunologically. As described in 
Example 7, the AK assay was not sensitive enough to detect enzyme activity in 
these extracts. However, AKIH-M4 protein was detected immunologically in eight 
of the twelve extracts. In some transformants, 564-21 A and 47 A, there was a 
large disparity between the level of expression of DHDPS and AKHI-M4, but in 10 
of 12 lines there was a good correlation. 

Free amino acids were extracted from leaves and analyzed for amino acid 
composition as described in Example 7. In the absence of significant AKIH-M4, 
the level of expression of the chimeric gene, 35S promoter/Cab 
leader/cts/ecodagA/Nos 3* determined the level of lysine accumulation (Table 6). 
Compare lines 564-21 A, 47 A and 39C, none of which expresses significant 
AKIH-M4. Line 564-21 A accumulates about 10-fold higher levels of lysine than 
line 564-47 A which expresses a lower level of E. coli DHDPS and 40-fold higher 
levels of lysine than 564-39C which expresses no E. coli DHDPS. However, in 
transformants that all expressed similar amounts of E. coli DHDPS (564-18A, 
56A, 36E, 55B, 47 A), the level of expression of the chimeric gene, 35S 
promoter/Cab leader/cts /lysC -M4/Nos 3\ controlled the level of lysine 
accumulation. Thus it is clear that although expression of 35S promoter/Cab 
leader/cts /lvsC -M4/Nos 3* has no effect on the free amino acid levels of leaves 
when expressed alone (see Example 7), it can increase lysine accumulation when 
expressed in concert with the 35S promoter/Cab leader/cts/ecodagA/Nos y 
chimeric gene. Expression of these genes together did not effect the level of any 
other free amino acid in the leaves. 
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TABLE 6 

BT564 Transfonnants: 35S promoter/Cab Ieader/cts/ecodapA/Nos 3' 
35S promoter/Cab Ieader/cts/lusC-M4/Nos 3' 

E. COLI 

FREE AA LEAF DHDPS WESTERN WESTERN 



LINE 


nmol/4mg 


FREE AA LEAF 


TT/Mfi/fTR 


DHDPS 


AK-m 




TOT 


K 


K/L 


K/TOT 








564-21A 


117 


57 


52 


0.49 


Z.H 


ITT 


4-/- 


564-18A 


99 


56 


69 


0.57 


1.1 


i t 

"T-r 


■ i, 


564-56A 


104 


58 


58 


0.56 


1.3 


■ i 

TT 


TT 


564-36E 


85 


17 


17 


0.20 


1.5 


• i_ 


it* 
1 1 1 


564-55B 


54 


5 


9.1 


0.10 


t A 
1.0 


++ 


+ 


564-47A 




t 

A 


4 8 


0.06 


A 9 
U.O 






564-35A 


37 


7 


13 


0.18 


0.3 


+ 


++ 


564-60D 


61 


3 


4.5 


0.06 


0.2 


+ 


++ 


564-45A 


46 


4 


8.1 


0.09 


0.4 


+ 


+ 


564-44B 


50 


1 


1.7 


0.02 


0.1 


+/- 




564-49A 


53 


1 


1.0 


0.02 


0 


+/- 




564-39C 


62 


1 


1.4 


0.02 


0 







Free amino acids were extracted from mature seeds derived from self- 
pollinated plants and quantitated as described in Example 8. There was no 
significant difference in the free amino acid content of seeds from untransformed 
plants compared to that from the plants showing the highest free lysine 
accumulation in leaves, i.e. plants 564-18A, 564-21A, 564-36E, 564-56A. 

EXAMPLE 12 

Transformation of Tobacco with th e Phaseoli n Promoter/cts /ecOdapA PWS 
Phaseolin Prrimoter/cts/lvsC-M4 Chimeric Genes 
The chimeric gene cassettes, phaseolin 5' region/cts/ecodapA^haseolin 3' 
region and phaseolin 5' region/cts AysC -M4/phaseolin 3' (Example 6) were 
combined in the binary vector pZS97 (Figure 6). The binary vector is described in 
Example 8. To accomplish this the phaseolin 5' region/cts/ecoda^Ayphaseolin 3' 
chimeric gene was isolated as a 2.7 kb Hind m fragment and inserted into the 
Hind m site of vector pUC1318 [Kay et al (1987) Nucleic Acids Res. 6:2778], 
yielding pBT568. It was then possible to digest pBT568 with BamH I and isolate 
the chimeric gene on a 2.7 kb BamH I fragment. This fragment was inserted into 
BamH I digested pBT549 (Example 8), yielding pBT570. This binary vector has 
both chimeric genes, phaseolin 5' region/cts/ecod^A^phaseolin 3' gene and 
phaseolin 5' region/cts/lyjsQ-M4/phaseolin 3' inserted in the same orientation. 
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The binary vector pBT570 was transferred by tri-parental mating to 
Agrobacterium strain LB A4404/p AL4404 , the Agrobacterium transform ants used 
to inoculate tobacco leaf disks and the resulting transgenic plants regenerated by 
the methods set out in Example 7. 
5 To assay for expression of the chimeric genes in the seeds of the transformed 

plants, the plants were allowed to flower, self-pollinate and go to seed Total 
proteins were extracted from mature seeds and analyzed via western blots as 
described in Example 8. 

Twenty-one of twenty-five transform ants expressed the DHDPS protein and 

10 nineteen of these also expressed the AKHI protein (Table 7). The amounts of the 
proteins expressed were related to the number of gene copies present in the 
transformants; the highest expressing lines, 570-4B, 570- 12C, 570-59B and 
570-23B, all had two or more sites of insertion of the gene cassette based on 
segregation of the kanamycin marker gene. Enzymatically active E. coli DHDPS 

15 was observed in mature seeds of all the lines tested wherein the protein was 
detected. 

To measure free amino acid composition of the seeds, free amino acids were 
extracted from mature seeds and analyzed as described in Example 8. There was a 
good correlation between transformants expressing higher levels of both DHDPS 

20 and AKIH protein and those having higher levels of free lysine and threonine. The 
highest expressing lines (marked by asterisk in Table 7) showed up to a 2-fold 
increase in free lysine levels and up to a 4-fold increase in the level of free 
threonine in the seeds. 

In the highest expressing lines it was possible to detect a high level of 

25 cc-aminoadipic acid. This compound is known to be an intermediate in the 

catabolism of lysine in cereal seeds, but is normally detected only via radioactive 
tracer experiments due to its low level of accumulation. The build-up of high 
levels of this intermediate indicates that a large amount of lysine is being produced 
in the seeds of these transformed lines and is passing through the catabolic 

30 pathway. The build-up of a-aminoadipic acid was not observed in transformants 
expressing only E. coli DHDPS or only AKIII-M4 in seeds. These results show 
that it is necessary to express both enzymes simultaneously to produce high levels 
of free lysine. 
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LINE 



IABLEJZ 

BT570 Transform ants: phaseolin 5'repon/cts/ lvsC -M4/phaseolin 3' region 
phaseolin S'region/cts fe^nHapA^ pbase tin 3' region 



FREE AMINO TOTAL AMINO WESTERN WESTERN £. CQ11 
ACIDS/SEED ACIDS/SEED EQ2LL KCQLL CUBES Progeny 
K/L T/L K/L T/L DHDPS AKm U/MO/HR Kar/tKan 8 



3 

ru 
hi 



01 

01 



fu 



NORMAL 

570-4B 

570-7C 

570-8B 

570-12C* 

570-18A 

570-24A 

570-37A 

570-44A 

570-46B 

570-51B 

570-59B* 

570-80A 

570-11 A 

570-17B 

570-20A 

570-21B 

570-23B* 

570-25D 

570-26A 

570-32A 

570-35A 

570-38A-1 

570-38A-3 

570-42A 

570-45A 

* indicates 



0.49 1.3 

0.31 2.6 

0.39 2.3 

0.29 2.1 

0.64 5.1 

0.33 3.0 

0.33 2.0 

0.33 2.1 

0.29 2.1 

0.41 2.1 

0.33 1.5 

0.46 3.0 

0.31 2.2 

0.28 2.3 

0.27 1.6 

0.41 2.3 

0.26 2.4 

0.40 3.6 

0.30 2.3 

0.28 1.5 

0.25 2.5 

0.25 2.5 

0.25 2.6 

0.33 1.6 

0.27 2.5 

0.60 3.4 
free amino acid sample 



0.35 0.68 

0.34 0.64 

0.34 0.64 

0.34 0.63 

0.36 0.68 

0.35 0.65 

0.34 0.65 

0.34 0.64 

0.34 0.64 

0.35 0.65 

0.33 0.64 

0.35 0.65 

0.34 0.64 

0.34 0.67 

0.34 0.65 

0.35 0.67 

0.34 0.68 

0.34 0.68 

0.35 0.66 

0.34 0.64 

0.34 0.67 

0.34 0.63 

0.34 0.64 

0.35 0.63 

0.34 0.62 

0.39 0.64 
has a-aminoadipic acid 



>4.3 



+/- 
+ 
+ 



0 

2.6 



+ 
+ 



3.1 



15:1 



>15:1 
15:1 



>15:1 



3:1 



63:1 



3:1 
3:1 

3:1 
3:1 
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EXAMPLE 13 
Use of the cts/lvsC-M4 Chimeric Gene as a Selectable 
Mafcer for Tobac co Transforation 

The 35S promoter/Cab leader/cts/ly£-M4/Nos 3* chimeric gene in the binary 
vector pZS97K (pBT542, see Example 7) was used as a selectable genetic marker 
for transformation of tobacco. High concentrations of lysine plus threonine inhibit 
growth of shoots from tobacco leaf disks. Expression of active lysine and 
threonine insensitive AKHI-M4 reverses this growth inhibition (see Example 7). 

The binary vector pBT542 was transferred by tri-parental mating to 
Agrobacterium strain LBA4404/pAL4404, the Agrobacterium transform ants used 
to inoculate tobacco leaf disks and the resulting transformed shoots were selected 
on shooting medium containing 3mM lysine plus 3mM threonine. Shoots were 
transferred to rooting media containing 3mM lysine plus 3mM threonine. Plants 
were grown from the rooted shoots. Leaf disks from the plants were placed on 
shooting medium containing 3mM lysine plus 3mM threonine. Transformed plants 
were identified by the shoot proliferation which occurred around the leaf disks on 
this medium. 

EXAMPLE 14 

Transformation of Tobacco with the 35S Prom oter/cts/cordanA Chimeric Gene 

The 35S promoter/Cab leader/cts/cordapA/Nos 3' chimeric gene was isolated 
as a 3.0 kb BamH I-Sal I fragment and inserted into BamH I-Sal I digested binary 
vector pZS97K (Figure 5), yielding plasmid pFS852. The binary vector is 
described in Example 7. 

The binary vector containing the chimeric co rdapA gene was transferred by 
tri-parental mating to Agrobacterium strain LBA4404/pAL4404, the 
Agrobacterium transformant was used to inoculate tobacco leaf disks and the 
resulting transgenic plants were regenerated by the methods set out in Example 7. 

To assay for expression of the chimeric gene in leaves of the transformed 
plants, protein was extracted as described in Example 7, with the following 
modifications. The supernatant from the first ammonium sulfate precipitation, 
approximately 18mL, was mixed with an additional 12mL of cold, saturated 
ammonium sulfate. The mixture was set on ice for 30 min and centrifuged as 
described in Example 7. The supernatant was decanted and the pellet, which 
contained the DHDPS protein, was resuspended in 1 mL of TNE and desalted by 
passage over a Sephadex G-25 M column (Column PD-10, Pharmacia). 

The leaf extracts were assayed for DHDPS protein and enzyme activity as 
described in Example 4. Corynebacteria DHDPS enzyme activity could be 
distinguished from tobacco DHDPS activity by its insensitivity to lysine inhibition. 
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Eight of eleven transformants showed Corynebacteria DHDPS expression, both as 
protein detected via western blot and as active enzyme. 

Free amino acids were extracted from leaves as described in Example 7. 
Expression of Corynebacteria DHDPS resulted in large increases in the level of 
free lysine in the leaves (Table 8). However, there was not a good correlation 
between the level of expression of DHDPS and the amount of free lysine 
accumulated. Free lysine levels from 2 to 50-fold higher than untransformed 
tobacco were observed. There was also a 2 to 2.5-fold increase in the level of total 
leaf lysine in the lines that showed high levels of free lysine. 

TABLE 8 

FS586 transformants: 35S promoter/Cab leader/cts/conJ^A/Nos 3' 

FREE AMINO TOTAL AMINO WESTERN CORYNE. 



LINE 


ACIDS/LEAF 
K/L 


ACIDS/LEAF 
K/L 


CORYNE. 
DHDPS 


DHDPS 
U/MG/HI 


NORMAL 


0.5 


0.8 






FS586-2A 


1.0 


0.8 






FS586-4A 


0.9 


0.8 


+ 


6.1 


FS586-11B 


3.6 


0.8 


+ 


3.4 


FS586-HD 


26 


2.0 


+ 


3.5 


FS586-13A 


2.4 


0.8 


+ 


3.5 


FS586-19C 


5.1 


0.8 


+ 


3.1 


FS586-22B 


>15 


1.5 


+ 


2.3 


FS586-30B 




0.8 






FS586-38B 


18 


1.5 


++ 


3.9 


FS586-51A 


1.3 


0.8 






FS586-58C 


1.2 


0.8 


+ 


5.1 



The plants were allowed to flower, self-pollinate and go to seed. Mature 
seed was harvested and assayed for free amino acid composition as described in 
Example 8. There was no difference in the free lysine content of the transformants 
compared to untransformed tobacco seed. 

EXAMPLE 15 

Transformation of Tobacco with the KTI3 promoter/cts/cordapA or 
Phaseolin Promoter/cts/cordapA plus 
Phaseolin Promote r/cts/lvsC-M4 Chimeric Qgngff 
The chimeric gene cassettes, KTO 5* region/cts/ cordacA/KTD 3* region and 
phaseolin 5' region/cts/ !ygQ-M4/phaseolin 3' as well as phaseolin 5' region/cts/ 
co rdapA/ phaseolin 3' region and phaseolin 5' region/cts/ bs£-M4/phaseolin 3' 
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(Example 6) were combined in the binary vector pZS97 (Figure 6). The binary 
vector is described in Example 8. 

To accomplish this the KTI3 5' region/cts/cordagA/ KT13 3' region chimeric 
gene cassette was isolated as a 3.3 kb BamH I fragment and inserted into BamH I 
digested pBT549 (Example 8), yielding pFS883. This binary vector has the 
chimeric genes, KTD 5' region/cts/cor dapA/ KTI3 3' region and phaseolin 5* 
region/cts/ly£-M4/phaseolin 3' region inserted in opposite orientations. 

The phaseolin 5 1 region/cts/cor dapA/ phaseolin 3'region chimeric gene 
cassette was modified using oligonucleotide adaptors to convert the Hind HI sites 
at each end to BamH I sites. The gene cassette was then isolated as a 2.7 kb 
BamH I fragment and inserted into BamH I digested pBT549 (Example 8), 
yielding pFS903. This binary vector has both chimeric genes, phaseolin 5* 
region/cts/co rdapAfo haseolin 3* region and phaseolin 5' 
region/cts flysC -M4/phaseolin 3* region inserted in the same orientation. 

The binary vectors pFS883 and pFS903 were transferred by tri-parental 
mating to Agrobacterium strain LBA4404/pAL4404, the Agrobacterium 
transform ants were used to inoculate tobacco leaf disks and the resulting 
transgenic plants were regenerated by the methods set out in Example 7. 

To assay for expression of the chimeric genes in the seeds of the transformed 
plants, the plants were allowed to flower, self-pollinate and go to seed. Total 
proteins were extracted from mature seeds and analyzed via western blots as 
described in Example 8. 

Twenty-one of twenty-two transform ants tested expressed the DHDPS 
protein and eighteen of these also expressed the AKIII protein (Table 8). 
Enzymatically active Corynebacteria DHDPS was observed in mature seeds of all 
the lines tested wherein the protein was detected except one. 

To measure free amino acid composition of the seeds, free amino acids were 
extracted from mature seeds and analyzed as described in Example 8. There was a 
good correlation between transformants expressing higher levels of both DHDPS 
and AKHI protein and those having higher levels of free lysine and threonine. The 
highest expressing lines showed up to a 3-fold increase in free lysine levels and up 
to a 8-fold increase in the level of free threonine in the seeds. As was described in 
Example 12, a high level of cc-aminoadipic acid, indicative of lysine catabolism, 
was observed in many of the transformed lines (indicated by asterisk in Table 9). 
There was no major difference in the free amino acid composition or level of 
protein expression between the transformants which had the KTD or Phaseolin 
regulatory sequences driving expression of the Corynebacteria DHDPS gene. 
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TABLED 

FS883 Transfonnants: phaseolin 5' iegioa/cts/TysC-M4/phaseolin 3* 
KTI3 5' region/cts/cordapA/KTI3 3' 

FS903 Transfonnants: phaseolin 5' region/cts/bsC-M4/phascolin 3' 
phaseolin 5 r rcgion/cts/cordsr^i>haseolin 3' 

FREE AMINO WESTERN WESTERN CORYNE. 
ACEDS/SEED CORYNE. ECQLl DHDPS 



LINE 


K/L 


T/L 


DHDPS AKIH 


U/MG/1 


NORMAL 


0.5 


1.3 


- 




FS883-4A 


0.9 


4.0 


+ + 




FS883-11A 


1.0 


3.5 


++ ++ 


3.1 


FS883-14B 


0.5 


2.5 


++ ++ 




FS883-16A* 


0.7 


10.5 


+ +++ 


0 


FS883-17A* 


1.0 


5.0 


+++ +++ 


7.0 


FS883-18C* 


1.2 


3.5 


++ + 


5.8 


FS883-21A 


0.5 


1.5 


+ +/- 




FS883-26B* 


1.1 


3.6 


++ ++ 


2.4 


FS883-29B 


0.5 


1.5 


+ 


0.4 


FS883-32B 


0.7 


2.4 


++ + 


1.5 


FS883-38B* 


1.1 


11.3 


+ ++ 


ZO 


FS883-59C* 


1.4 


6.1 


+ + 


0.5 


FS903-3C 


0.5 


1.8 


+ +++ 




FS903-8A* 


0.8 


2.1 


+++ MM 




FS903-9B 


0.6 


1.8 


++ ++ 


4.3 


FS903-10A 


0.5 


1.5 


- 




FS903-22F 


0.5 


1.8 


++ ++ 


0.9 


FS903-35B* 


0.8 


2.1 


++ ++ 




FS903-36B 


0.7 


1-5 


+ 




FS903-40A 


0.6 


1.8 


+ + 




FS903-41A* 


1.2 


2.0 


++ +++ 




FS903-42A 


0.7 


2.2 


++ +++ 


5.4 


FS903-44C 


0.5 


1.9 






FS903-53B 


0.6 


1.9 






* indicates free amino acid sample hs 


is a-aminoadipic acid 





Progeny 
Kan^KanS 



>15:1 
3:1 



3:1 



3:1 



15:1 



Free amino acid composition and expression of bacterial DHDPS and AKIH 
proteins was also analyzed in developing seeds of two lines that segregated as 
single gene cassette insertions (see Table 10). Expression of the DHDPS protein 
under control of the KTI3 promoter was detected at earlier times than that of the 
AKIH protein under control of the Phaseolin promoter, as expected. At 14 days 
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after flowering both proteins were expressed at a high level and there was about an 
8-fold increase in the level of free lysine compared to normal seeds. These results 
confirm that simultaneous expression of lysine insensitive DHDPS and lysine- 
insensitive AK results in the production of high levels of free lysine in seeds. Free 
lysine does not continue to accumulate to even higher levels, however. In mature 
seeds free lysine is at a level 2 to 3-fold higher than in normal mature seeds, and 
the lysine breakdown product ct-aminoadipic acid accumulates. These results 
provide further evidence that lysine catabolism occurs in seeds and prevents 
accumulation of the high levels of free lysine produced in transform ants expressing 
lysine insensitive DHDPS and lysine insensitive AK. 

TABLE 10 

Developing seeds of FS883 Transform ants: 

phaseolin 5* region/cts/bsC-M4/phaseolin 3' region 
KTI3 5' iegion/cts/coxdgpA/KTI3 3' region 

FREE AMINO WESTERN WESTERN 





DAYS AFTER 


ACIDS/SEED 


CQRYNE» 




LINE 


FLOWERING 


K/L 


T/L 


DHDPS 


AKm 


FS883-18C 


9 


1.1 


2.1 






FS883-18C 


10 


1.4 


3.3 


+/- 




FS883-18C 


11 


1.4 


2.5 


+ 




FS883-18C 


14 


4.3 


1.0 


++ 


++ 


FS883-18C* 


MATURE 


1.2 


3.5 


+++ 


++ 


FS883-32B 


9 


1.3 


2.9 


+ 




FS883-32B 


10 


1.6 


2.7 


+ 




FS883-32B 


11 


1.4 


2.3 


+ 




FS883-32B* 


14 


3.9 


1.3 


++ 


++ 


FS883-32B* 


MATURE 


0.7 


2.4 


+++ 


++ 



* indicates free amino acid sample has cc-aminoadipic acid 

EXAMPLE 1$ 

Transformation of Oilseed Rape with the Phaseolin Promo ter/cts/cordapA and 
Pha scplin Prpmoter/cts/lysC-M4 Qiimew Qcn$$ 
The chimeric gene cassettes, phaseolin 5' region/ cts/cordapA^phaseolin 3' 
region, phaseolin 5* region/ ctsflysC-M4/phaseolin 3', and phaseolin 5' region/ 
cts/cordagA/phaseolin 3* region plus phaseolin 5* rcgion/cts flysC -M4frhaseolin 3' 
(Example 6) were inserted into the binary vector pZS199 (Figure 7 A), which is 
similar to pSZ97K described in Example 8. In pZS199 the 35S promoter from 
Cauliflower Mosaic Virus replaced the nos promoter driving expression of the 
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NPT II to provide better expression of the marker gene, and the orientation of the 
polylinker containing the multiple restriction endonuclease sites was reversed. 

To insert the phaseolin 5' region/cts/cor dapA/ phaseolin 3'region 3', the gene 
cassette was isolated as a 2.7 kb BamH I fragment (as described in Example 15) 
and inserted into BamH I digested pZS199, yielding plasmid pFS926 (Figure 7B). 
This binary vector has the chimeric gene, phaseolin 5' region/cts/cordggA/phaseolin 
3' region inserted in the same orientation as the 35S/NPT UJnos 3 f marker gene. 

To insert the phaseolin 5' region7cts /lysC -M4/phaseolin 3' region, the gene 
cassette was isolated as a 3.3 kb EcoR I to Spe I fragment and inserted into EcoR I 
plus Xba I digested pZS199, yielding plasmid pBT593 (Figure 7C). This binary 
vector has the chimeric gene, phaseolin 5' regioiVcts/ly^-M4^haseolin 3' region 
inserted in the same orientation as the 35S/NPT D/nos 3* marker gene. 

To combine the two cassettes, the EcoR I site of pBT593 was converted to a 
BamH I site using oligonucleotide adaptors, the resulting vector was cut with 
BamH I and the phaseolin 5' regioiVcts/cordaEA/ phaseolin 3* region gene cassette 
was isolated as a 2.7 kb BamH I fragment and inserted, yielding pBT597 (Figure 
7D), This binary vector has both chimeric genes, phaseolin 5 f 
region/cts/cordaj^/phaseolin 3' region and phaseolin 5' region/cts /lysC - 
M4/phaseolin 3* region inserted in the same orientation as the 35S/NPT H/nos 3' 
marker gene. 

Brassica napus cultivar "Westar" was transformed by co-cultivation of 
seedling pieces with disarmed Agrobacterium tumefaciens strain LBA4404 
carrying the the appropriate binary vector. 

B. napus seeds were sterilized by stirring in 10% Chlorox, 0.1% SDS for 
thirty min, and then rinsed thoroughly with sterile distilled water. The seeds were 
germinated on sterile medium containing 30 mM CaC12 and 1.5% agar, and grown 
for 6 d in the dark at 24°C. 

Liquid cultures of Agrobacterium for plant transformation were grown 
overnight at 28°C in Minimal A medium containing 100 mg/L kanamycin. The 
bacterial cells were pelleted by centrifugation and resuspended at a concentration 
of 10** cellsAnL in liquid Murashige and Skoog Minimal Organic medium 
containing 100 uM acetosyringone. 

B. napus seedling hypocotyls were cut into 5 mm segments which were 
immediately placed into the bacterial suspension. After 30 min, the hypocotyl 
pieces were removed from the bacterial suspension and placed onto BC-35 callus 
medium containing 100 uM acetosyringone. The plant tissue and Agrobacteria 
were co-cultivated for 3 d at 24°C in dim light. 
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The co-cultivation was terminated by transferring the hypocotyl pieces to 
BC-35 callus medium containing 200 mg/L carbenicillin to kill the Agrobacteria, 
and 25 mg/L kanamycin to select for transformed plant cell growth. The seedling 
pieces were incubated on this medium for three weeks at 24°C under continuous 
light. 

After three weeks, the segments were transferred to BS-48 regeneration 
medium containing 200 mg/L carbenicillin and 25 mg/L kanamycin. Plant tissue 
was subcultured every two weeks onto fresh selective regeneration medium, under 
the same culture conditions described for the callus medium. Putatively 
transformed calli grew rapidly on regeneration medium; as calli reached a diameter 
of about 2mm, they were removed from the hypocotyl pieces and placed on the 
same medium lacking kanamycin 

Shoots began to appear within several weeks after transfer to BS-48 
regeneration medium. As soon as the shoots formed discemable stems, they were 
excised from the calli, transferred to MSV-1A elongation medium, and moved to a 
16:8-h photoperiod at 24°C. 

Once shoots had elongated several intemodes, they were cut above the agar 
surface and the cut ends were dipped in Rootone. Treated shoots were planted 
directly into wet Metro-Mix 350 soiless potting medium. The pots were covered 
with plastic bags which were removed when the plants were clearly growing, after 
about 10 days. Results of the transformation are shown in Table 11. Transformed 
plants were obtained with each of the binary vectors. 

Minimal A Paraxial Qrowth Mgflwn 
Dissolve in distilled water: 

10.5 g potassium phosphate, dibasic 

4.5 g potassium phosphate, monobasic 

1.0 g ammonium sulfate 

0.5 g sodium citrate, dihydrate 
Make up to 979 mL with distilled water 
Autoclave 

Add 20 mL filter-sterilized 10% sucrose 
Add 1 mL filter-sterilized 1 M MgS04 

Pr assica Callus Medium PC-35 

Per liter: 

Murashige and Skoog Minimal Organic Medium 

(MS salts, 100 mg/L i-inositol, 0.4 mg/L thiamine; GIBCO #5 10-31 18) 
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30 g sucrose 
18 g mannitol 
0.5 mg/L 2,4-D 
0.3 mg/L kinetin 
0.6% agarose 
pH5.8 

Brassica Regeneration Medium BS^8 

Murashige and Skoog Minimal Organic Medium 

Gamborg B5 Vitamins (SIGMA #1019) 

10 g glucose 

250 mg xylose 

600mgMES 

0.4% agarose 

pH5.7 

Hlter-sterilize and add after autoclaving: 
2.0 mg/L zeatin 
0.1 mg/LIAA 

Biasaga Shoot Elongation Medium MSY-1 A 

Murashige and Skoog Minimal Organic Medium 

Gamborg B5 Vitamins 

10 g sucrose 

0.6% agarose 

pH5.8 

IABLEJ1 
Canola transform ants 



NUMBER OF 



BINARY 
VECTOR 


NUMBER OF 
CUT ENDS 


NUMBER OF 
KAN R CALLI 


SHOOTING 
CALLI 


NUMBER OF 
PLANTS 


pZS199 


120 


41 


5 


2 


pFS926 


600 


278 


52 


28 


pBT593 


600 


70 


10 


3 


pBT597 


600 


223 


40 


23 



Plants were grown under a 16:8-h photoperiod, with a daytime temperature 
of 23°C and a nightime temperature of 17°C. When the primary flowering stem 
began to elongate, it was covered with a mesh pollen-containment bag to prevent 
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outcrossing. Self-pollination was facilitated by shaking the plants several times 
each day. Mature seeds derived from self-pollinations were harvested about three 
months after planting. 

A partially defatted seed meal was prepared as follows: 40 mg of mature dry 
seed was ground with a mortar and pestle under liquid nitrogen to a fine powder. 
One milliliter of hexane was added and the mixture was shaken at room 
temperature for 15 min. The meal was pelleted in an eppendorf centrifuge, the 
hexane was removed and the hexane extraction was repeated. Then the meal was 
dried at 65° for 10 min until the hexane was completely evaporated leaving a dry 
powder. Total proteins were extracted from mature seeds as follows. 
Approximately 30-40 mg of seeds were put into a 1.5 mL disposable plastic 
microfuge tube and ground in 0.25 mL of 50 mM Tris-HQ pH 6.8, 2 mM EDTA, 
1% SDS, 1% P-mercaptoethanol. The grinding was done using a motorized 
grinder with disposable plastic shafts designed to fit into the microfuge tube. The 
resultant suspensions were centrifuged for 5 min at room temperature in a 
microfuge to remove particulates. Three volumes of extract was mixed with 1 
volume of 4 X SDS-gel sample buffer (0.17M Tris-HQ pH6.8, 6.7% SDS, 16.7% 
(3-mercaptoethanol, 33% glycerol) and 5 \iL from each extract were run per lane 
on an SDS polyacrylamide gel, with bacterially produced DHDPS or AKHI serving 
as a size standard and protein extracted from untransformed tobacco seeds serving 
as a negative control. The proteins were then electrophoretically blotted onto a 
nitrocellulose membrane. The membranes were exposed to the DHDPS or AKHI 
antibodies at a 1:5000 dilution of the rabbit serum using standard protocol 
provided by BioRad with their Immun-Blot Kit. Following rinsing to remove 
unbound primary antibody the membranes were exposed to the secondary 
antibody, donkey anti-rabbit Ig conjugated to horseradish peroxidase (Amersham) 
at a 1:3000 dilution. Following rinsing to remove unbound secondary antibody, 
the membranes were exposed to Amersham chemiluminescence reagent and X-ray 
film. 

Eight of eight FS926 transformants and seven of seven BT597 transfonnants 
expressed the DHDPS protein. The single BT593 transformant and five of seven 
BT597 transformants expressed the AKDI-M4 protein (Table 12). Thus it is 
straightforward to express these proteins in oilseed rape seeds. 

To measure free amino acid composition of the seeds, free amino acids were 
extracted from 40 mg of the defatted meal in 0.6 mL of methanol/chloroform/water 
mixed in ratio of 12v/5v/3v (MCW) at room temperature. The mixture was 
vortexed and then centrifuged in an eppendorf microcentrifuge for about 3 min. 
Approximately 0.6 mL of supernatant was decanted and an additional 0.2 mL of 
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MCW was added to the pellet which was then vortexed and centrifuged as above. 
The second supernatant, about 0.2 mL, was added to the first. To this, 0.2 mL of 
chloroform was added followed by 0.3 mL of water. The mixture was vortexed 
and then centrifuged in an eppendorf microcentrifuge for about 3 min, the upper 
aqueous phase, approximately 1.0 mL, was removed, and was dried down in a 
Savant Speed Vac Concentrator. The samples were hydrolyzed in 6N hydrochloric 
acid, 0.4% (5-mercaptoethanoi under nitrogen for 24 h at 1 10-120°C; 1/4 of the 
sample was run on a Beckman Model 6300 amino acid analyzer using post-column 
ninhydrin detection. Relative free amino acid levels in the seeds were compared as 
ratios of lysine or threonine to leucine, thus using leucine as an internal standard 

In contrast to tobacco seeds, expression of Corynebacterium DHDPS lead to 
large increases in accumulation of free lysine in rapeseed transform ants. The 
highest expressing lines showed a greater than 100-fold increase in free lysine level 
in the seeds. The transform ant that expressed AKQI-M4 in the absence of 
Corynebacteria DHDPS showed a 5-fold increase in the level of free threonine in 
the seeds. Concomitant expression of both enzymes resulted in accumulation of 
high levels of free lysine, but not threonine. 

A high level of oc-aminoadipic acid, indicative of lysine catabolism, was 
observed in many of the transformed lines. Thus, prevention of lysine catabolism 
by inactivation of lysine ketoglutarate reductase should further increase the 
accumulation of free lysine in the seeds. Alternatively, incorporation of lysine into 
a peptide or lysine-rich protein would prevent catabolism and lead to an increase in 
the accumulation of lysine in the seeds. 

To measure the total amino acid composition of mature seeds, 2 mg of the 
defatted meal were hydrolyzed in 6N hydrochloric acid, 0.4% p-mercaptoethanol 
under nitrogen for 24 h at 1 10-120°C; 1/100 of the sample was run on a Beckman 
Model 6300 amino acid analyzer using post-column ninhydrin detection. Relative 
amino acid levels in the seeds were compared as percentages of lysine, threonine or 
oc-aminoadipic acid to total amino acids. 

There was a good correlation between expression of DHDPS protein and 
accumulation of high levels of lysine in the seeds of transformants. Seeds with a 
5-100% increase in the lysine level, compared to the untransformed control, were 
observed. In the transformant with the highest level, lysine makes up about 13% of 
the total seed amino acids, considerably higher than any previously known 
rapeseed seed. This transformant expresses high levels of both E. coli AKDI-M4 
and Corynebacterium DHDPS. 
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IABL&I2 

FS926 Transform ants: phaseolin 5' region/cts/cordapA/phaseolin 3 1 
BT593 Transform ants: phaseolin 5* regioD/cts /lysC -M4/phaseolin 3' 
BT597 Transform ants: phaseolin 5* region/cts/bsC-M4^>haseolin 3' 
phaseolin 5' region/cts/coid^pA/phaseolin 3' 

WESTERN WESTERN % TOTAL AMINO 
FREE AMINO ACIDS CORYNE. £.££21/ ACIDS 



LINE 


K/L T/L AA/L 


DHDPS 


AKQI-M4 


K T AA 


WESTAR 


0.8 2.0 


0 






6.5 5.6 0 


ZS199 


1.3 3.2 


0 






6.3 5.4 0 




140 2.0 


16 


iiii 
TTTT 




H CI t A 


FS926-9 


110 1.7 


12 


nil 




11 5.0 0.8 


FS926-11 


7.9 2.0 


5.2 


++ 




7.7 5.2 0 


FS926-6 


14 1.8 


4.6 






8.2 5.9 0 


FS926-22 


3.1 1.3 


0.3 


+ 




6.9 5.7 0 


FS926-27 


4.2 1.9 


1.1 


++ 




7.1 5.6 0 


PS926-29 


38 1.8 


4.7 


++++ 




12 5.2 1.6 


FS926-68 


4.2 1.8 


0.9 


4-4- 
TT 




14 < < A 

O.J V 


BT593-42 


1.4 11 


0 




++ 


6.3 6.0 0 


BT597-14 


6.0 2.6 


4.3 


++ 


+/- 


7.0 5.3 0 


BT597-145 


1.3 2.9 


0 


+ 






BT597-4 


38 3.7 


4.5 


++++ 


++++ 


13 5.6 1.6 


BT597-68 


4.7 2.7 


1.5 


++ 


+ 


6.9 5.8 0 


BT597-100 


91 1.9 


1.7 


+++ 


++ 


6.6 5.7 0 


BT597-148 


7.6 2.3 


0.9 


+++ 


+ 


7.3 5.7 0 


BT597-169 


5.6 2.6 


1.7 


+++ 


+++ 


6.6 5.7 0 



AA is a-amino adipic acid 

EXAMPLE 17 
Tyan^fonn^tion <?f Maize Using a Chimeri c lysC-M4 Cten<? 
3$ a Selectee Mwkc r 

Embryogcnic callus cultures were initiated from immature embryos (about 
1 .0 to 1 .5 mm) dissected from kernels of a com line bred for giving a "type II 
callus" tissue culture response. The embryos were dissected 10 to 12 d after 
pollination and were placed with the axis-side down and in contact with agarose - 
solidified N6 medium [Chu et al. (1974) Sci Sin 75:659-668] supplemented with 
0.5 mg/L 2,4-D (N6-0.5). The embryos were kept in the dark at 27°C. Friable 
embiyogenic callus consisting of undifferentiated masses of cells with somatic 
proembryos and somatic embryos borne on suspensor structures proliferated from 
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the scutellum of the immature embryos. Clonal embryogenic calli isolated from 
individual embryos were identified and sub-cultured on N6-0.5 medium every 2 to 
3 weeks. 

The particle bombardment method was used to transfer genes to the callus 
culture cells. A Biolistic™ PDS-1000/He (BioRAD Laboratories, Hercules, CA) 
was used for these experiments. 

The plasmid pBT573, containing the chimeric gene HH534 5' region/ 
mcts/lysC-M4/HH2-l 3' region (see Example 6) designed for constitutive gene 
expression in corn, was precipitated onto the surface of gold particles. To 
accomplish this 2.5 \ig of pBT573 (in water at a concentration of about 1 mg/mL) 
was added to 25 mL of gold particles (average diameter of 1.5 pm) suspended in 
water (60 mg of gold per mL). Calcium chloride (25 mL of a 2.5 M solution) and 
spermidine (10 mL of a 1.0 M solution) were then added to the gold-DNA 
suspension as the tube was vortexing. The gold particles were centrifuged in a 
microfuge for 10 s and the supernatant removed. The gold particles were then 
resuspended in 200 mL of absolute ethanol, were centrifuged again and the 
supernatant removed. Finally, the gold particles were resuspended in 25 mL of 
absolute ethanol and sonicated twice for one sec. Five pL of the DNA-coated gold 
particles were then loaded on each macro carrier disk and the ethanol was allowed 
to evaporate away leaving the DNA-covered gold particles dried onto the disk. 

Embryogenic callus (from the callus line designated #132.2.2) was arranged 
in a circular area of about 6 cm in diameter in the center of a 100 X 20 mm petri 
dish containing N6-0.5 medium supplemented with 0.25M sorbitol and 0.25M 
mannitol. The tissue was placed on this medium for 2 h prior to bombardment as a 
pretreatment and remained on the medium during the bombardment procedure. At 
the end of the 2 h pretreatment period, the petri dish containing the tissue was 
placed in the chamber of the PDS-1000/He. The air in the chamber was then 
evacuated to a vacuum of 28 inch of Hg. The macrocarrier was accelerated with a 
helium shock wave using a rupture membrane that bursts when the He pressure in 
the shock tube reaches 1 100 psi. The tissue was placed approximately 8 cm from 
the stopping screen. Four plates of tissue were bombarded with the DNA-coated 
gold particles. Immediately following bombardment, the callus tissue was 
transferred to N6-0.5 medium without supplemental sorbitol or mannitol. 

Seven d after bombardment small (2-4 mM diameter) clumps of callus tissue 
were transferred to N6-0.5 medium lacking casein or proline, but supplemented 
with 2mM each of lysine and threonine (LT). The tissue continued to grow slowly 
on this medium and was transferred to fresh N6-0.5 medium supplemented with LT 
every 2 weeks. After 12 weeks two clones of actively growing callus was 



85 



identified on two separate plates containing LT-supplemented medium. These 
clones continued to grow when sub-cultured on the selective medium. The 
presence of the lysC -M4 gene in the selected clones was confirmed by PCR 
analysis. Callus was transferred to medium that promotes plant regeneration. 
5 EXAMPLE 18 

Transformation of Corn with the 
Constitutive Corn Promoter/cts/ecodapA and 
Constitutive Com Promoter/cts/lvsC-M4 
The chimeric gene cassettes, HH534 5* region/ mcts/ecodie£/HH2-l 3' 
. 10 region plus HH534 5' region/ mcts/lysC-M4/HH2-l 3' region, (Example 6) were 

inserted into the vector pGem9z to generate a com transformation vector. Plasmid 
pBT583 (Example 6) was digested with Sal I and an 1850 bp fragment containing 
p the HH534 5' region/mcts/ecodapA/HH2-l 3' region gene cassette was isolated. 

O This DNA fragment was inserted into pBT573 (Example 6), which carries the 

S H 15 HH534 5' region/mcts/ lvsC-M4/HH2-l 3' region, digested with Xho L The 
O resulting vector with both chimeric genes in the same orientation was designated 

pBT586. 

Vector pBT586 was introduced into embryogenic corn callus tissue using the 
particle bombardment method. The establishment of the embryogenic callus 
20 cultures and the parameters for particle bombardment were as described in 
Example 17. 

Either one of two plasmid vectors containing selectable markers were used in 
the transformations. One plasmid, pALSLUC [Fromm et al. (1990) Biotechnology 
5:833-839], contained a cDNA of the maize acetolactate synthase (ALS) gene. 
25 The ALS cDNA had been mutated in vitro so that the enzyme coded by the gene 
would be resistant to chlorsulfuron. This plasmid also contains a gene that uses the 
35S promoter from Cauliflower Mosaic Virus and the 3* region of the nopaline 
synthase gene to express a firefly luciferase coding region [de Wet et al. (1987) 
Molec. Cell Biol. 7:725-737]. The other plasmid, pDETRIC, contained the bar 
30 gene from Streptomvces hvgroscopicus that confers resistance to the herbicide 
glufosinate [Thompson et al. (1987 The EMBO Journal 6:2519-2523]. The 
bacterial gene had its translation codon changed from GTG to ATG for proper 
translation initiation in plants [De Block et al. (1987) The EMBO Journal 
6:2513-2518]. The bar gene was driven by the 35S promoter from Cauliflower 
35 Mosaic Virus and uses the termination and polyadenylation signal from the 
octopine synthase gene from Agrobacterium tumefaciens. 

For bombardment, 2.5 pg of each plasmid, pBT586 and one of the two 
selectable marker plasmids, was co-precipitated onto the surface of gold particles 
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as described in Example 17. Bombardment of the embryogenic tissue cultures was 
also as described in Example 17. 

Seven days after bombardment the tissue was transferred to selective 
medium. The tissue bombarded with the selectable marker pALSLUC was 
5 transferred to N6-0.5 medium that contained chlorsulfuron (30 ng/L) and lacked 
casein or proline. The tissue bombarded with the selectable marker, pDETRIC, 
was transferred to N6-0.5 medium that contained 2 mg/L glufosinate and lacked 
casein or proline. The tissue continued to grow slowly on these selective media. 
After an additional 2 weeks the tissue was transferred to fresh N6-0.5 medium 
10 containing the selective agents. 

Chlorsulfuron- and glufosinate-resistance callus clones could be identified 
after an additional 6-8 weeks. These clones continued to grow when transferred to 

r~ the selective media. 

CI 

fi The presence of pBT586 in the transformed clones has been confirmed by 

PJ 15 PCH analysis. Functionality of the introduced AK enzyme was tested by plating 

out transformed clones on N6-0.5 media containing 2 mM each of lysine and 
yl threonine (LT selection; see Example 13). All of the clones were capable of 

growing on LT medium indicating that the E. coli aspartate kinase was expressed 
h& and was functioning properly. To test that the E. coli DHDPS enzyme was 

£ 20 functional, transformed callus was plated on N6-0.5 media containing 2pm 
L i 2-aminoethylcysteine (AEC), a lysine analog and potent inhibitor of plant DHDPS. 

The transformed callus tissue was resistant to AEC indicating that the introduced 
DHDPS, which is about 16-fold less sensitive to AEC than the plant enzyme, was 
being produced and was functional. Plants have been regenerated from several 
25 transformed clones and are being grown to maturity. 

EXAMPLE 19 

Transformation of Soybean with the Phaseolin Promoter/cts/cordapA and 

Phaseolin Promoter/cts/lvsC-M4 Chimeric Genes 
The chimeric gene cassettes, phaseolin 5* region/ cts/co rdapA/ phaseolin 3' 
30 region plus phaseolin 5' region/cts /lysC -M4/phaseolin 3\ (Example 6) were 

inserted into the soybean transformation vector pBT603 (Figure 8A). This vector 
has a soybean transformation marker gene consisting of the 35S promoter from 
Cauliflower Mosaic Virus driving expression of the E. coli (J-glucuronidase gene 
[Jefferson et al. (1986) Proc. Natl. Acad. Sci. USA 55:8447-8451] with the Nos 3* 
35 region in a modified pGEM9Z plasmid. 

To insert the phaseolin 5' region/cts/ly£-M4/ phaseolin 3' region, the gene 
cassette was isolated as a 3.3 kb Hind in fragment and inserted into Hind HI 
digested pBT603, yielding plasmid pBT609. This binary vector has the chimeric 
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gene, phaseolin 5' region/ cts/ly§Q-M4/phaseolin 3' region inserted in the opposite 
orientation from the 35S/GUS/Nos 3' marker gene. 

To insert the phaseolin 5' region/cts/cordagA/ phaseolin 3'region 3', the gene 
cassette was isolated as a 2.7 kb BamH I fragment (as described in Example 15) 
and inserted into BamH I digested pBT609, yielding plasmid pBT614 (Figure 8B). 
This vector has both chimeric genes, phaseolin 5' regioiVcts/ly£-M4yfchaseolin 3' 
region and phaseolin 5' region/cts/co rdapA/ phaseolin 3' region inserted in the same 
orientation, and both are in the opposite orientation from the 35S/GUS/Nos 3' 
marker gene. 

Soybean was transformed with plasmid pBT614 according to the procedure 
described in United States Patent No. 5,015,580. Soybean transformation was 
performed by Agracetus Company (Middleton, WI). Seeds from five transformed 
lines were obtained and analyzed. 

It was expected that the transgenes would be segregating in the Rl seeds of 
the transformed plants. To identify seeds that carried the transformation marker 
gene, a small chip of the seed was cut off with a razor and put into a well in a a 
disposable plastic microtiter plate. A GUS assay mix consisting of 100 mM 
NaH 2 P0 4 , 10 mM EDTA, 0.5 mM K4Fe(CN) 6 , 0.1% Triton X-100, 0.5 mgAnL 
5-Bromo-4-chloro-3-indolyl (3-D-glucuronic acid was prepared and 0.15 mL was 
added to each microtiter well. The microtiter plate was incubated at 37° for 
45 min. The development of blue color indicated the expression of GUS in the 
seed. 

Five of seven transformed lines showed approximately 3:1 segregation for 
GUS expression indicating that the GUS gene was inserted at a single site in the 
soybean genome. The other transform ants showed 9:1 and 15:1 segregation, 
suggesting that the GUS gene was inserted at two sites. 

A meal was prepared from a fragment of individual seeds by grinding into a 
fine powder. Total proteins were extracted from the meal by adding 1 mg to 
0.1 mL of 43 mM Tris-HCl pH 6.8, 1.7% SDS, 4.2% 0-mercaptoethanol, 8% 
glycerol, vortexing the suspension, boiling for 2-3 min and vortexing again. The 
resultant suspensions were centrifuged for 5 min at room temperature in a 
microfiige to remove particulates and 10 \iL from each extract were run per lane 
on an SDS poly aery 1 amide gel, with bacterially produced DHDPS or AKDI serving 
as a size standard. The proteins were then electrophoretically blotted onto a 
nitrocellulose membrane. The membranes were exposed to the DHDPS or AKJH 
antibodies, at a 1:5000 or 1:1000 dilution, respectively, of the rabbit serum using 
standard protocol provided by BioRad with their Immun-Blot Kit. Following 
rinsing to remove unbound primary antibody the membranes were exposed to the 
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secondary antibody, donkey anti-rabbit Ig conjugated to horseradish peroxidase 
(Amersham) at a 1:3000 dilution. Following rinsing to remove unbound secondary 
antibody, the membranes were exposed to Amersham chemiluminescence reagent 
and X-ray film. 

Six of seven transformants expressed the DHDPS protein. In the six 
transfoimants that expressed DHDPS, there was excellent correlation between 
expression of GUS and DHDPS in individual seeds (Table 13). Therefore, the 
GUS and DHDPS genes are integrated at the same site in the soybean genome. 
Four of seven transformants expressed the AKIII protein, and again there was 
excellent correlation between expression of AKHI, GUS and DHDPS in individual 
seeds (Table 13). Thus, in these four transformants the GUS, AKIII and DHDPS 
genes are integrated at the same site in the soybean genome. One transform ant 
expressed only GUS in its seeds. 

To measure free amino acid composition of the seeds, free amino acids were 
extracted from 8-10 milligrams of the meal in 1.0 mL of methanol/chloro- 
form/water mixed in ratio of 12v/5v/3v (MCW) at room temperature. The mixture 
was vortexed and then centrifuged in an eppendorf microcentrifuge for about 
3 min; approximately 0.8 mL of supernatant was decanted. To this supernatant, 
0.2 mL of chloroform was added followed by 0.3 mL of water. The mixture was 
vortexed and then centrifuged in an eppendorf microcentrifuge for about 3 min, the 
upper aqueous phase, approximately 1.0 mL, was removed, and was dried down in 
a Savant Speed Vac Concentrator. The samples were hydrolyzed in 6N 
hydrochloric acid, 0.4% {i-mercaptoethanol under nitrogen for 24 h at 1 10-120°C; 
1/10 of the sample was run on a Beckman Model 6300 amino acid analyzer using 
post-column ninhydrin detection. Relative free amino acid levels in the seeds were 
compared as ratios of lysine to leucine, thus using leucine as an internal standard. 

Soybean transformants expressing Corynebacteria DHDPS alone and in 
concert with E. coli AKHI-M4 accumulated high levels of free lysine in their seeds. 
From 20 fold to 120-fold increases in free lysine levels were observed (Table 13). 
A high level of saccharopine, indicative of lysine catabolism, was also observed in 
seeds that contained high levels of lysine. Thus, prevention of lysine catabolism by 
inactivation of lysine ketoglutarate reductase should further increase the 
accumulation of free lysine in the seeds. Alternatively, incorporation of lysine into 
a peptide or lysine-rich protein would prevent catabolism and lead to an increase in 
the accumulation of lysine in the seeds. 

To measure the total amino acid composition of mature seeds, 1-1.4 
milligrams of the seed meal was hydrolyzed in 6N hydrochloric acid, 0.4% 
p-mercaptoethanol under nitrogen for 24 h at 1 10-120°C; 1/50 of the sample was 
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run on a Beckman Model 6300 amino acid analyzer using post-column ninhydrin 
detection. Lysine (and other amino acid) levels in the seeds were compared as 
percentages of the total amino acids. 

The soybean seeds expressing Cory neb acteria DHDPS showed substantial 
increases in accumulation of total seed lysine. Seeds with a 5-35% increase in total 
lysine content, compared to the untransformed control, were observed. In these 
seeds lysine makes up 7.5-7.7% of the total seed amino acids. 

Soybean seeds expressing Corynebacteria DHDPS in concert with E. coli 
AKHI-M4 showed much greater accumulation of total seed lysine than those 
expressing Corynebacteria DHDPS alone. Seeds with a more than four-fold 
increase in total lysine content were observed. In these seeds lysine makes up 
20-25% of the total seed amino acids, considerably higher than any previously 
known soybean seed. 



LINE-SEED GUS 
A2396-145-4 
A2396- 145-8 
A2396-145-5 
A2396-145-3 

A2396-145-9 + 
A2396-145-6 ' + 
A2396-145-1 + 
A2396-145-10 + 
A2396- 145-7 + 
A2396-145-2 + 



TABLE 13 

Free LYS/LEU 

0.9 

1.0 

0.8 

1.0 

2.0 

4.6 

8.7 
18.4 
21.7 
45.5 



DHDPS 



AKDl 



+ 
+ 



% TOTAL 
SEEDLYS 

5.8 



5.9 



7.5 
6.7 
7.2 



A5403-175-9 




1.3 






A5403-175-4 




1.2 




6.0 


A5403- 175-3 




1.0 




6.0 


A5403- 175-7 


+ 


1.5 






A5403- 175-5 


+ 


1.8 






A5403-175-1 


+ 


6.2 






A5403-175-2 


+ 


6.5 




6.3 


A5403-175-6 


+ 


14.4 






A5403- 175-8 


+ 


47.8 


+ 


7.7 


A5403- 175-10 


+ 


124.3 


+ 


7.5 



90 



A5403-181-9 + 1.4 

A5403-181-10 + 1.4 - 5.7 

A5403-181-8 + 0.9 

A5403-181-6 + 1.5 

A5403-181-4 - 0.7 - - 5.9 

A5403-181-5 + 1.1 

A5403-181-2 - 1.8 - 5.6 

A5403-181-3 + 2.7 - 5.5 

A5403-181-7 + 1.9 

A5403-181-1 - 2.3 

A5403-183-9 - 0.8 

A5403-183-6 - 0.7 - 6.0 

o 

□ A5403-183-8 - 1.3 

W A5403-183-4 - 1.3 ■ - - 6.0 



p A5403-183-5 + 0.9 

A5403-183-3 + 3.1 

A5403-183-1 + 3.3 



01 



P A5403-183-7 + 9.9 

m 

H A5403-183-10 + 22.3 + + 6.7 

pa 

SI A5403-183-2 + 23.1 + + 7.3 

A5403-196-8 ' 0.9 - 5.9 

A5403-196-6 + 8.3 

A5403-196-1 + 16.1 + + 6.8 

A5403-196-7 + 27.9 

A5403-196-3 + 528 

A5403-196-5 + 26 

A5403-196-2 + 16.2 + + 

A5403-196-10 + 29 + + 7.5 

A5403-196-4 + 58.2 + + 7.6 

A5403-196-9 + 47.1 

A2396-233-1 + + + 25 

A2396-233-2 + 18 

A2396-233-3 + 23 

A2396-233-4 + 20 

A2396-233-5 - +/- - 6.0 

A2396-233-6 + 16 
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A2396-233-13 + + + 18 

A2396-234-1 + + + 8.3 

A2396-234-2 + + + 13 

A2396-234-3 + 10 

A2396-234-4 + 19 

A2396-234-9 + 15 

A2396-234-16 - - - 5.9 

wildtype - 0.9 - 5.6 
control 



EXAMPLE 20 
Isolation of a Plant 
Lysine Ketoglutarate R eductase Gene 

Lysine Ketoglutarate Reductase (LKR) enzyme activity has been observed in 
immature endosperm of developing maize seeds [Arruda et al. (1982) Plant 
Physiol. 69:988-989]. LKR activity increases sharply from the onset of endosperm 
development, reaches a peak level at about 20 d after pollination, and then declines 
[Arruda et al. (1983) Phytochemistry 22:2687-2689]. 

In order to clone the com LKR gene, RNA was isolated from developing 
seeds 19 days after pollination. This RNA was sent to Clontech Laboratories, Inc., 
(Palo Alto, CA) for the custom synthesis of a cDNA library in the vector Lambda 
Zap II. The conversion of the Lambda Zap II library into a phagemid library, then 
into a plasmid library was accomplished following the protocol provided by 
Clontech. Once converted into a plasmid library the ampicillin-resistant clones 
obtained carry the cDNA insert in the vector pBluescript SK(-). Expression of the 
cDNA is under control of the lacZ promoter on the vector. 

Two phagemid libraries were generated using die mixtures of the Lambda 
Zap H phage and the filamentous helper phage of 100 pL to 1 pX. Two additional 
libraries were generated using mixtures of 100 pL Lambda Zap H to 10 pL helper 
phage and 20 pL Lambda Zap II to 10 pL helper phage. The titers of the 
phagemid preparations were similar regardless of the mixture used and were about 
2 x 10 3 ampicillin-resistant-transfectants per mL with E. coli strain XL 1 -Blue as 
the host and about 1 x 10 3 with DEI 26 (see below) as host. 

To select clones that carried the LKR gene a specially designed E. coli host, 
DE126 was constructed. Construction of DE126 occurred in several stages. (1) A 
generalized transducing stock of coliphage Plvir was produced by infection of a 
culture of TST1 [F~» araD139, delta(argF-lac.)205, flb_5301, ptsF25, reJAl, 
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rpsL150. malE52::TnlO, deoC l. X'] (E. coli Genetic Stock Center #6137) using a 
standard method (for Methods see J. Miller, Experiments in Molecular Genetics). 

(2) This phage stock was used as a donor in a transductional cross (for 
Method see J. Miller, Experiments in Molecular Genetics) with strain GIF106M1 
[F\ arg-, flvA296, lysClOOl. thrAl 101, metLlOOO, >„-• ip§L9, malT l. xyi-7, mtl-2, 
thil(?), supE 44(?)1 (E. £oli Genetic Stock Center #5074) as the recipient. 
Recombinants were selected on rich medium [L supplemented with DAP] 
containing the antibiotic tetracycline. The transposon TnlO, conferring tetracycline 
resistance, is inserted in the malE gene of strain TST1. Tetracycline-resistant 
transductants derived from this cross are likely to contain up to 2 min of the E. coli 
chromosome in the vicinity of malE. The genes malE and lysC are separated by 
less than 0.5 minutes, well within cotransduction distance. 

(3) 200 tetracycline-resistant transductants were thoroughly phenotyped; 
appropriate fermentation and nutritional traits were scored. The recipient strain 
GIF106M1 is completely devoid of aspartokinase isozymes due to mutations in 
thrA, metL and lysC . and therefore requires the presence of threonine, methionine, 
lysine and meso-diaminopimelic acid (DAP) for growth. Transductants that had 
inherited lysC+ with malE::Tnl0 from TST1 would be expected to grow on a 
minimal medium that contains vitamin B 1, L-arginine, L-isoleucine and L-valine in 
addition to glucose which serves as a carbon and energy source. Moreover strains 
having the genetic constitution of lysC*. metL - and thrA - will only express the 
lysine sensitive aspartokinase. Hence addition of lysine to the minimal medium 
should prevent the growth of the lysC * recombinant by leading to starvation for 
threonine, methionine and DAP. Of the 200 tetracycline resistant transductants 
examined, 49 grew on the minimal medium devoid of threonine, methionine and 
DAP. Moreover, all 49 were inhibited by the addition of L-lysine to the minimal 
medium. One of these transductants was designated DE125. DE125 has the 
phenotype of tetracycline resistance, growth requirements for arginine, isoleucine 
and valine, and sensitivity to lysine. The genotype of this strain is F" malE52::TnlO 
arg- flvA296 thrA l 101 metL lOOO lambda- rpsL9 malT l xyl-7 mtl-2 thil(?) 
supE4 4(?). 

(4) This step involves production of a male derivative of strain DE125. 
Strain DE125 was mated with the male strain AB1528 [F16/delta(gpt-BiQA)62, 
lacYl or lacZ4, glnV44. galK 2 rac'(?), hisG 4. rfbdl, mgl-51, kdgK51(?), flv£7, 
ar gE 3. thi-1] (E. coli Genetic Stock Center #1528) by the method of conjugation. 
F16 carries the ilvGMEDAYC gene cluster. The two strains were cross streaked 
on rich medium permissive for the growth of each strain. After incubation, the 
plate was replica plated to a synthetic medium containing tetracycline, arginine, 
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vitamin B 1 and glucose. DE125 cannot grow on this medium because it cannot 
synthesize isoleucine. Growth of AB 1528 is prevented by the inclusion of the 
antibiotic tetracycline and the omission of proline and histidine from the synthetic 
medium. A patch of cells grew on this selective medium. These recombinant cells 
underwent single colony isolation on the same medium. The phenotype of one 
clone was determined to be Dv+» Arg", TetR, Lysine-sensitive, male specific phage 
(MS2)-sensitive, consistent with the simple transfer of F16 from AB1528 to 
DEI 25. This clone was designated DEI 26 and has the genotype 
F16/maffi52::TnlO, arg-, ilvA296. thrAllOl. metL lOO. lvsC+ . Af, rpsL9. malTl. 
xyl-7, mtl-2, thi-1?. supE44?. It is inhibited by 20 |Xg/mL of L-lysine in a synthetic 
medium. 

To select for clones from the corn cDNA library that carried the LKR gene, 
100 \iL of the phagemid library was mixed with 100 |iL of an overnight culture of 
DEI 26 grown in L broth and the cells were plated on synthetic media containing 
vitamin Bl, L-arginine, glucose as a carbon and energy source, 100 \lg/mL 
ampicillin and L-lysine at 20, 30 or 40 ^tg/mL. Four plates at each of the three 
different lysine concentrations were prepared. The amount of phagemid and 
DEI 26 cells was expected to yield about 1 x 10 5 ampicillin-resistant transfectants 
per plate. Ten to thirty lysine-resistant colonies grew per plate (about 1 lysine- 
resistant per 5000 ampicillin-resistant colonies). 

Plasmid DNA was isolated from 10 independent clones and retransfoimcd 
into DE126. Seven of the ten DNAs yielded lysine-resistant clones demonstrating 
that the lysine-resistance trait was carried on the plasmid Several of the cloned 
DNAs were sequenced and biochemically characterized. The inserted DNA 
fragments were found to be derived from the E. coli genome, rather than a com 
cDNA indicating that the cDNA library provided by Clontech was contaminated. 

Another method was used to identify plant cDNAs that encode LKR. This 
method was based upon expected homology between plant LKR and fungal genes 
encoding saccharopine dehydrogenase. Fungal saccharopine dehydrogenase 
(glutamate-forming) and saccharopine dehydrogenase (lysine-forming) catalyze the 
final two steps in the fungal lysine biosynthetic pathway. Plant LKR and fungal 
saccharopine dehydrogenase (lysine-forming) catalyze both forward and reverse 
reactions, use identical substrates and use similar co-factors. Similarly, plant 
saccharopine dehydrogenase (glutamate-forming), which catalyzes the second step 
in the lysine catabolic pathway, works in both forward and reverse reactions, uses 
identical substrates and uses similar co-factors as fungal saccharopine 
dehydrogenase (glutamate-forming). 
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Biochemical and genetic evidence derived from human and bovine studies 
has demonstrated that mammalian LKR and saccharopine dehydrogenase 
(glutamate-forming) enzyme activities are present on a single protein with a 
monomer molecular weight of about 1 17,000. This contrasts with the fungal 
5 enzymes which are carried on separate proteins, saccharopine dehydrogenase 
(lysine-forming) with a molecular weight of about 44,000 and saccharopine 
dehydrogenase (glutamate-forming) with a molecular weight of about 51,000. 
Plant LKR has been reported to have a molecular weight of about 140,000 
indicating that it is like the animal catabolic protein wherein both LKR and 
10 saccharopine dehydrogenase (glutamate-forming) enzyme activities are present on 
a single protein. 

Several genes for fungal saccharopine dehydrogenases have been isolated and 
sequenced [Xuan et al. (1990) Mol. Cell. Biol. 10:4795^806, Feller et al. (1994) 
MoL Cell. BioL 14:641 1-6418]. The fungal protein sequences, deduced from 

15 these gene sequences, were used to search plant cDNA databases for DNA 
fragments that encoded plant proteins homologous to the fungal saccharopine 
dehydrogenases. We discovered two plant cDNA fragments from Arabidopsis 
thaliana, SEQ ID NO: 102; and SEQ ID NO:103:, that encoded polypeptides SEQ 
ID NO: 104: and SEQ ID NO:105:, respectively, that are homologous to fungal 

20 saccharopine dehydrogenase (glutamate-forming). The sequence similarity 

between the fungal and plant polypeptides (see Figure 9) demonstrate that these 
cDNAs encode Arabidopsis saccharopine dehydrogenase. Full length cDNAs 
encoding plant LKR plus saccharopine dehydrogenase or genomic DNAs 
containing the entire LKR/saccharqpine dehydrogenase gene can be readily 

25 identified by hybridization to labelled cDNA fragments of SEQ ID NO: 102: or 
SEQ ED NO: 103: and thus isolated. 

In order to block expression of the LKR gene in transformed plants, a 
chimeric gene designed for cosuppression of LKR can be constructed by linking 
the LKR gene or gene fragment to any of the plant promoter sequences described 

30 above. (See U.S. Patent No. 5,23 1 ,020 for methodology to block plant gene 

expression via cosuppression.) Alternatively, a chimeric gene designed to express 
antisense RNA for all or part of the LKR gene can be constructed by linking the 
LKR gene or gene fragment in reverse orientation to any of the plant promoter 
sequences described above. (See U.S. patent 5,107,065 for methodology to block 

35 plant gene expression via antisense RNA.) Either the cosuppression or antisense 
chimeric gene could be introduced into plants via transformation. Transform ants 
wherein expression of the endogenous LKR gene is reduced or eliminated are 
selected. 
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EXAMPLE 21 

Construction of Synthetic Genes in Expression Vector pSK5 
To facilitate the construction and expression of the synthetic genes described 
below, it was necessary to construct a plasmid vector with the following attributes: 

1 . No Ear I restriction endonuclease sites such that insertion of 
sequences would produce a unique site. 

2. Containing a tetracycline resistance gene to avoid loss of plasmid 
during growth and expression of toxic proteins. 

3. Containing approximately 290 bp from plasmid pBT430 including 
the T7 promoter and tenninator seqment for expression of inserted sequences in 
E. coli. 

4. Containing unique EcoR I and Nco I restriction endonuclease 
recognition sites in proper location behind the T7 promoter to allow insertion of 
the oligonucleotide sequences. 

To obtain attributes 1 and 2 Applicants used plasmid pSKl which was a 
spontaneous mutant of pBR322 where the ampicillin gene and the Ear I site near 
that gene had been deleted Plasmid pSKl retained the tetracycline resistance 
gene, the unique EcoR I restriction sites at base 1 and a single Ear I site at base 
2353. To remove the Ear I site at base 2353 of pSKl a polymerase chain reaction 
(PCR) was performed using pSKl as the template. Approximately 10 femtomoles 
of pSKl were mixed with 1 |xg each of oligonucleotides SM70 and SM71 which 
had been synthesized on an ABI1306B DNA synthesizer using the manufacturer's 
procedures. 

SM70 S'-CTGACTCGCTGCGCTCGGTC 3' SEQ ID NO: 16 

SM71 S'-TATTTTCTCCTTACGCATCTGTGC-S' SEQ ID NO: 17 

The priming sites of these oligonucleotides on the pSKl template are 
depicted in Figure 10. The PCR was performed using a Perkin-Elmer Cetus kit 
(Emeryville, CA) according to the instructions of the vendor on a thermocycler 
manufactured by the same company. The 25 cycles were 1 min at 95°, 2 min at 
42° and 12 min at 72°. The oligonucleotides were designed to prime replication of 
the entire pSKl plasmid excluding a 30 b fragment around the Ear I site (see 
Figure 10). Ten microliters of the 100 julL reaction product were run on a 1% 
agarose gel and stained with ethidium bromide to reveal a band of about 3.0 kb 
corresponding to the predicted size of the replicated plasmid. 

The remainder of the PCR reaction mix (90 \lL) was mixed with 20 JxL of 
2.5 mM deoxynucleotide triphosphates (dATP, dTTP, dGTP, and dCTP), 30 units 
of Klenow enzyme added and the mixture incubated at 37° for 30 min followed by 
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65° for 10 min. The Klenow enzyme was used to fill in ragged ends generated by 
the PCR. The DNA was ethanol precipitated, washed with 70% ethanol, dried 
under vacuum and resuspended in water. The DNA was then treated with T4 
DNA kinase in the presence of 1 mM ATP in kinase buffer. This mixture was 
incubated for 30 min at 37° followed by 10 min at 65°. To 10 \iL of the kinased 
preparation, 2 p,L of 5X ligation buffer and 10 units of T4 DNA ligase were added. 
The ligation was carried out at 15° for 16 h. Following ligation, the DNA was 
divided in half and one half digested with Ear I enzyme. The Klenow, kinase, 
ligation and restriction endonuclease reactions were performed as described in 
Sambrook et al., [Molecular Cloning, A Laboratory Manual, 2nd ed. (1989) Cold 
Spring Harbor Laboratory Press]. Klenow, kinase, ligase and most restriction 
endonucleases were purchased from BRL. Some restriction endonucleases were 
purchased from NEN Biolabs (Beverly, MA) or Boehringer Mannheim 
(Indianapolis, IN). Both the ligated DNA samples were transformed separately 
into competent JM103 [supE thi del (lac-proAB) F [traD36 porAB, laclq lacZ del 
M15] restriction minus] cells using the CaCl2 method as described in Sambrook et 
al., [Molecular Cloning, A Laboratory Manual, 2nd ed. (1989) Cold Spring 
Harbor Laboratory Press] and plated onto media containing 12.5 ug/mL 
tetracycline. With or without Ear I digestion the same number of transformants 
were recovered suggesting that the Ear I site had been removed from these 
constructs. Clones were screened by preparing DNA by the alkaline lysis miniprep 
procedure as described in Sambrook et al., [Molecular Cloning, A Laboratory 
Manual, 2nd ed. (1989) Cold Spring Harbor Laboratory Press] followed by 
restriction endonuclease digest analysis. A single clone was chosen which was 
tetracycline-resistant and did not contain any Ear I sites. This vector was 
designated pSK2. The remaining EcoR I site of pSK2 was destroyed by digesting 
the plasmid with EcoR I to completion, filling in the ends with Klenow and 
ligating. A clone which did not contain an EcoR I site was designated pSK3. 

To obtain attributes 3 and 4 above, the bacteriophage T7 RNA polymerase 
promoterAerminator segment from plasmid pBT430 (see Example 2) was amplified 
by PCR. Oligonucleotide primers SM78 (SEQ ID NO: 18) and SM79 (SEQ ID 
NO: 19) were designed to prime a 300b fragment from pBT430 spanning the T7 
promoter/terminator sequences (see Figure 10). 

SM78 5^TTCATCGATAGGCGACCACACCCGTCC-3' SEQ ID NO: 18 
SM79 5 , -AATATCGATGCCACGATGCGTCCGGCG-3 f SEQ ID NO: 19 
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The PCR reaction was carried out as described previously using pBT430 as 
the template and a 300 bp fragment was generated. The ends of the fragment were 
filled in using Klenow enzyme and kinased as described above. DNA from plasmid 
pSK3 was digested to completion with PvuII enzyme and then treated with calf 
intestinal alkaline phophatase (Boehringer Mannheim) to remove the 5' phosphate. 
The procedure was as described in Sambrook et al., [Molecular Cloning, A 
Laboratory Manual, 2nd ed. (1989) Cold Spring Harbor Laboratory Press]. The 
cut and phosphatased pSK3 DNA was purified by ethanol precipitation and a 
portion used in a ligation reaction with the PCR generated fragment containing the 
T7 promoter sequence. The ligation mix was transformed into JM103 [supE thi 
del (lac-proAB) F [traD36 porAB, laclq lacZ del Ml 5] restriction minus] and 
tetracycline-resistant colonies were screened. Plasmid DNA was prepared via the 
alkaline lysis mini prep method and restriction endonuclease analysis was 
performed to detect insertion and orientation of the PCR product. Two clones 
were chosen for sequence analysis: Plasmid pSK5 had the fragment in the 
orientation shown in Figure 10. Sequence analysis performed on alkaline 
denatured double-stranded DNA using Sequenase® T7 DNA polymerase (US 
Biochemical Coip) and manufacturer's suggested protocol revealed that pSK5 had 
no PCR replication errors within the T7 promoter/terminator sequence. 

The strategy for the construction of repeated synthetic gene sequences based 
on the Ear I site is depicted in Figure 1 1 . The first step was the insertion of an 
oligonucleotide sequence encoding a base gene of 14 amino acids. This 
oligonucleotide insert contained a unique Ear I restriction site for subsequent 
insertion of oligonucleotides encoding one or more heptad repeats and added an 
unique Asp 718 restriction site for use in transfer of gene sequences to plant 
vectors. The overhanging ends of the oligonucleotide set allowed insertion into the 
unique Nco I and EcoR I sites of vector pSK5. 



MEEKMKAMEEK 



SM81 



5 1 -G^TGGAGGAGAAGATGAAGGCGATGC^AGAGAAG 



SM80 



3 1 -£TCCTCTTCTACTTCCGCTAC£XT£XCTTC 



NCO I 



EAR I 



M K A 



(SEQ ID 



NO:22) 



SM81 



ATGAAGGCGTGAT AGGTACCG - 3 « 



(SEQ ID 



NO:20) 



SM80 



T A CTTflf!Gfl Af!T AT CC1AT GGCTT AA— 5 • 



(SEQ ID 



NO:21) 



ASP718 ECOR I 
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DNA from plasmid pSK5 was digested to completion with Nco I and EcoR I 
restriction endonucleases and purified by agarose gel electrophoresis. Purified 
DNA (0.1 ug) was mixed with 1 \ig of each oligonucleotide SM80 (SEQ ID 
NO: 14) and SM81 (SEQ ID NO:13) and ligated. The ligation mixture was 
transformed into E. coli strain JM103 [supE thi del (lac-proAB) F [traD36 porAB, 
laclq lacZ del Ml 5] restriction minus] and tetracycline resistant transform ants 
screened by rapid plasmid DNA preps followed by restriction digest analysis. A 
clone was chosen which had one each of Ear I, Nco I, Asp 718 and EcoR I sites 
indicating proper insertion of the oligonucleotides. This clone was designated 
pSK6 (Figure 12). Sequencing of the region of DNA following the T7 promoter 
confirmed insertion of oligonucleotides of the expected sequence. 

Repetitive heptad coding sequences were added to the base gene construct of 
described above by generating oligonucleotide pairs which could be directly ligated 
into the unique Ear I site of the base gene. Oligonucleotides SM84 (SEQ ID 
NO:23) and SM85 (SEQ ID NO:24) code for repeats of the SSP5 heptad. 
Oligonucleotides SM82 (SEQ ID NO:25) and SM83 (SEQ ID NO:26) code for 
repeats of the SSP7 heptad. 



SSP5 




M E E K M K A 


(SEQ 


ID 


NO:28) 


SM84 


5 


1 -GATGGAGGAGAAGATGAAGGC- 3 ' 


(SEQ 


ID 


NO:23) 


SM85 


3 


1 - CCTCCTCTTCTACTTCCGCTA-5 1 


(SEQ 


ID 


NO:24) 


SSP7 




M E E K L K A 


(SEQ 


ID 


NO:27) 


SM82 


5 1 - 


- GATGGAGGAGAAGCTGAAGGC- 3 1 


(SEQ 


ID 


NO:25) 


SM83 


3'- 


CCTCCTCTTCGACTTCCGCTA-5 » 


(SEQ 


ID 


NO;26) 



Oligonucleotide sets were ligated and purified to obtain DNA fragments 
encoding multiple heptad repeats for insertion into the expression vector. 
Oligonucleo-tides from each set totalling about 2 [ig were kinased, and ligated for 
2 h at room temperature. The ligated multimers of the oligonucleotide sets were 
separated on an 18% non-denaturing 20 X 20 X 0.015 cm polyacrylamide gel 
(Acrylamide: bis-acrylamide = 19:1). Multimeric forms which separated on the 
gel as 168 bp (8n) or larger were purified by cutting a small piece of 
polyacrylamide containing the band into fine pieces, adding 1.0 mL of 0.5 M 
ammonium acetate, 1 mM EDTA (pH 7,5) and rotating the tube at 37° overnight. 
The polyacrylamide was spun down by centrifugation, 1 jig of tRNA was added to 
the supernatant,the DNA fragments were precipitated with 2 volumes of ethanol at 
-70°, washed with 70% ethanol, dried, and resuspended in 10 |iL of water. 
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Ten micrograms of pSK6 DNA were digested to completion with Ear I 
enzyme and treated with calf intestinal alkaline phosphatase. The cut and 
phosphatased vector DNA was isolated following electrophoresis in a low melting 
point agarose gel by cutting out the banded DNA, liquifying the agarose at 55°, 
and purifying over NACS PREPAC_ columns (BRL) following manufacturer's 
suggested procedures. Approximately 0.1 |ig of purified Ear I digested and 
phosphatase treated pSK6 DNA was mixed with 5 |XL of the gel purified 
multimeric oligonucleotide sets and ligated. The ligated mixture was transformed 
into E. coli strain JM103 [supE thi del (lac-proAB) F [traD36 porAB, laclq lacZ 
del Ml 5] restriction minus] and tetracycline-resistant colonies selected. Clones 
were screened by restriction digests of rapid plasmid prep DNA to determine the 
length of the inserted DNA. Restriction endonuclease analyses were usually 
carried out by digesting the plasmid DNAs with Asp 718 and Bgl II, followed by 
separation of fragments on 18% non-denaturing polyacrylamide gels. Visualization 
of fragments with ethidium bromide, showed that a 150 bp fragment was generated 
when only the base gene segment was present. Inserts of the oligonucleotide 
fragments increased this size by multiples of 21 bases. From this screening several 
clones were chosen for DNA sequence analysis and expression of coded sequences 
in E. coli. 

Table 14 

Sequence by Heptad 
qpn^# SEQ ID NQ; Amino Acid Rep eat ($SP) $E Q XD NO : 



C15 29 5. 7.7.7.7.7 .5 30 

C20 31 5. 7.7.7.7.7 .5 32 

C30 33 5 .7.7.7.7 .5 34 

D16 35 5.5,5.5 36 

D20 37 5.5.5.5.5 38 

D33 39 5.5,5.5 40 



The first and last SSP5 heptads flanking the sequence of each construct are from 
the base gene described above. Inserts are designated by underlining. 

Because the gel purification of the oligomeric forms of the oligonucleotides 
did not give the expected enrichment of longer (i.e., >8n) inserts, Applicants used a 
different procedure for a subsequent round of insertion constructions. For this 
series of constructs four more sets of oligonucleotides were generated which code 
for SSP 8,9,10 and 1 1 amino acid sequences respectively: 
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SSPB 




M E E K T. K K 


# 


(SEQ 


ID 


NO:49) 


SM86 


5 « 


-GATGGAGGAGAAC;rTGAAGAA-3 1 




(SEQ 


ID 


NO: 41) 


SMS 7 


3» 


CCTCCTCTTCGACTTCTTCTA 


-5' 


(SEQ 


ID 


N0:42) 


SSP9 




M E E K L K W 




(SEQ 


ID 


NO: 50) 


SM88 


5 • 


-GATGGAGGAGAAGCTGAAGTG- 3 ' 




(SEQ 


ID 


NO: 43) 


SM89 


3' 


CCTCCTCTTCGACTTCACCTA 


-5' 


(SEQ 


ID 


NO:44) 


SSP10 




M E E K M K K 




(SEQ 


ID 


NO: 51) 


SM90 


5 ■ 


-GATGGAGGAGAAGATGAAGAA-3 1 




(SEQ- 


ID 


NO:45) 


SM91 


3' 


CCTCCTCTTCTACTTCTTCTA 


-5' 


<SEQ 


ID 


NO:46) 


SSP11 




M E E K M K W 

& A Jt-J AX A A AX ¥W 




(SEQ 


ID 


NO: 52) 


SM92 


5' 


-GATGGAGGAGAAGATGAAGTG-3 ' 




(SEQ 


ID 


NO:47) 


SM93 




- CCTCCTCTTCTACTTCACCTA 


-5 1 


(SEQ 


ID 


NO : 4 0 ) 



The following HPLC procedure was used to purify multimeric forms of the 
oligonucleotide sets after kinasing and ligating the oligonucleotides as described 
above. Chromatography was performed on a Hewlett Packard Liquid 
Chromatograph instrument, Model 1090M. Effluent absorbance was monitored at 
260 run. Ligated oligonucleotides were centrifuged at 12,000xg for 5 min and 
injected onto a 2.5 \i TSK DEAE-NPR ion exchange column (35 cm x 4.6 mm 
I.D.) fitted with a 0.5 \i in-line filter (Supelco). The oligonucleotides were 
separated on the basis of length using a gradient elution and a two buffer mobile 
phase [Buffer A: 25 mM Tris-Cl, pH 9.0, and Buffer B: Buffer A + 1 M NaCl]. 
Both Buffers A and B were passed through 0.2 \i filters before use. The following 
gradient program was used with a flow rate of 1 mL per min at 30°: 



Time %A %>B 

initial 75 25 

0.5 min 55 45 

5 min 50 50 

20 min 38 62 

23 min 0 100 

30 min 0 100 

31 min 75 25 
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Fractions (500 jiLJwere collected between 3 min and 9 min. Fractions 
corresponding to lengths between 120 bp and 2000 bp were pooled as determined 
from control separations of restriction digests of plasmid DNAs. 

The 4.5 raL of pooled fractions for each oligonucleotide set were 
5 precipitated by adding 10 jig of tRNA and 9.0 mL of ethanol, rinsed twice with 
70% ethanol and resuspended in 50 [ih of water. Ten \iL of the resuspended 
HPLC purified oligonucleotides were added to 0.1 [ig of the Ear I cut, 
phosphatased pSK6 DNA described above and ligated overnight at 15°. All six 
oligonucleotide sets described above which had been Jrinased and self-ligated but 

10 not purified by gel or HPLC were also used in separate ligation reactions with the 
pSK6 vector. The ligation mixtures were transformed into E. coli strain DH5a 
[supE44 del lacU169 (phi 80 lacZ del M15) hsdR17 recAl endAl gyrl96 thil 
relAl] and tetracycline-resistant colonies selected. Applicants chose to use the 
DH5a [supE44 del lacU169 (phi 80 lacZ del M15) hsdR17 rccAl endAl gyrl96 

15 thil relAl] strain for all subsequent work because this strain has a very high 

transformation rate and is recA- . The recA- phenotype eliminates concerns that 
these repetitive DNA structures may be substrates for homologous recombination 
leading to deletion of multimeric sequences. 

Clones were screened as described above. Several clones were chosen to 

20 represent insertions of each of the six oligonucleotide sets. 

Table 15 

Sequence by Hcptad 
Clone # SEQ ID NO: Amino Acid Repeat (SSP) SEP ID NO : 



82-4 53 7.7.7.7.7.7 .5 54 

84-H3 55 5.M.5 56 

86-H23 57 5.M-5 58 

88-2 59 5.9.9.9.5 60 

90-H8 61 5. 10.10.10 .5 62 

92-2 63 5. 11.11 .5 64 



The first and last SSP5 heptads flanking the sequence represent the base gene 
sequence. Insert sequences are underlined. Clone numbers including the letter "H" 

25 designate HPLC-purified oligonucleotides. The loss of the first base gene repeat in 
clone 82-4 may have resulted from homologous recombination between the base 
gene repeats 5.5 before the vector pSK6 was transferred to the rccA- strain. The 
HPLC procedure did not enhance insertion of longer multimeric forms of the oligo- 
nucleotide sets into the base gene but did serve as an efficient purification of the 

30 ligated oligonucleotides. 
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Oligonucleotides were designed which coded for mixtures of the SSP 
sequences and which varied codon usage as much as possible. This was done to 
reduce the possibility of deletion of repetitive inserts by recombination once the 
synthetic genes were transformed into plants and to extend the length of the 
constructed gene segments. These oligonucleotides encode four repeats of heptad 
coding units (28 amino acid residues) and can be inserted at the unique Ear I site in 
any of the previously constructed clones. SM96 and SM97 code for SSP(5>4, 
SM98 and SM99 code for SSP(7)4 and SM100 plus SM101 code for SSP8.9.8.9. 

MEEKMKAMEEKMK 
SM9 6 5 1 -GATGGAGGAAAAGATGAAGGCGATGGAGGAGAAAATGAAA 

SM97 3 1 CCTCCTTTTCTACTTCCGCTACCTCCTCTTTTACTTT 

AMEEKMKAME EKMKA (SEQ ID NO: 67) 

GCTATGGAGGAAAAGATGAAAGCGATGGAGGAGAAAATGAAGGC-3 ' (SEQ ID NO: 65) 

CGATACCTCCTTTTCTACTTTCGCTACCTCCTCTTTTACTTCCGCTA- 5 1 (SEQ ID NO: 66) 

MEEKLKAMEEKLK 
SM9 8 5 • -GATGGAGGAAAAGCTGAAAGCGATGGAGGAGAAACTCAAG 

SM9 9 3 f CCTCCTTTTCGACTTTCGCTACCTCCTCTTTGAGTTC 

AMEEKLKAMEEKLKA (SEQ ID NO: 70) 

GCTAtGGAAGAAAAGCTTAAAGCGATGGAGGAGAAACTGAAGGC-3 ' (SEQ ID NO: 68) 

CGATACCTTCTTTTCGAATTTCGCATCCTCCTCTTTGACTTCCGCTA-5' (SEQ ID NO: 69) 

MEEKLKKMEEKLK 
SMI 00 5 ' -GATGGAGGAAAAGCTTAAGAAGATGGAAGAAAAGCTGAAA 

SMI 01 3 1 CCTCCTTTTCGAATTCTTCTACCTTCTTTTCGACTTT 

WMEEKLK KMEEKLKW (SEQ ID NO: 73) 

TGGATGGAGGAGAAACTCAAAAAGATGGAGGAAAAGCTTAAATG-3' (SEQ ID NO: 71) 

ACCTACCTCCTCTTTGAGTTTTTCATCCTCCTTTTCGAATTTACCTA- 5 1 (SEQ ID NO: 72) 

DNA from clones 82-4 and 84-H3 were digested to completion with Ear I 
enzyme, treated with phosphatase and gel purified. About 0.2 fXg of this DNA 
were mixed with 1.0 |ig of each of the oligonucleotide sets SM96 and SM97, 
SM98 and SM99 or SM100 and SM101 which had been previously kinased. The 
DNA and oligonucleotides were ligated overnight and then the ligation mixes 
transformed intoE. coli strain DH5a. Tetracycline-resistant colonies were 
screened as described above for the presence of the oligonucleotide inserts. Clones 
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were chosen for sequence analysis based on their restriction endonuclease digestion 
patterns. 



Table 16 

Sequence by Heptad 
Clone # SEP ID NO: Amino Acid Repeat (SSP) SEP ID NO : 

2- 9 74 7.7.7.7.7.7. 8.9.8.9 .5 75 

3- 5 78 7.7.7.7.7.7.5.5 79 

5-1 76 5.5.5.7.7.7.7.5 77 

Inserted oligonucleotide segments are underlined 



Clone 2-9 was derived from oligonucleotides SM100 (SEQ ID NP:71) and 
SM101 (SEQ ID NP:72) ligated into the Ear I site of clone 82-4 (see above). 
Clone 3-5 (SEQ ID NP:78) was derived from the insertion of the first 22 bases of 
the oligonucleotide set SM96 (SEQ ID NP:65) and SM97 (SEQ ID NP:66) into 
the Ear I site of clone 82-4 (SEQ ID NP:53X This partial insertion may reflect 
improper annealing of these highly repetitive oligos. Clone 5-1 (SEQ ID NP:76) 
was derived from oligonucleotides SM98 (SEQ ID NP:68) and SM99 (SEQ ID 
NP:69) ligated into the Ear I site of clone 84-H3 (SEQ ID NP:55). 



Strategy II. 

A second strategy for construction of synthetic gene sequences was 
implemented to allow more flexibility in both DNA and amino acid sequence. This 
strategy is depicted in Figure 13 and Figure 14. The first step was the insertion of 
an oligonucleotide sequence encoding a base gene of 16 amino acids into the 
original vector pSK5. This oligonucleotide insert contained an unique Ear I site as 
in the previous base gene construct for use in subsequent insertion of 
oligonucleotides encoding one or more heptad repeats. The base gene also 
included a BspH I site at the 3* terminus. The overhanging ends of this cleavage 
site are designed to allow "in frame" protein fusions using Nco I overhanging ends. 
Therefore, gene segments can be multiplied using the duplication scheme described 
in Figure 14. The overhanging ends of the oligonucleotide set allowed insertion 
into the unique Nco I and EcoR I sites of vector pSK5. 



MBEKMKKLEEK 
SMI 07 5 1 -CATGGAGGAGAAGATGAAAAAGCTCGAAGAGAAG 

SMI 06 3 1 -CTCCTCTTCTACTTTTTCGAGCTTCTCTTC 

NCO I EAR I 
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M K V M K (SEQ ID NO: 82) 

ATGAAGGTCATGAAGTGATAGGTACCG-3 1 (SEQ ID NO: 80) 

TACTTCCAGTACTTCACTATCCATGGCTTAA-5 1 (SEQ ID NO: 81) 
BSPH I ASP 718 

The oligonucleotide set was inserted into pSK5 vector as described in Strategy I 
above. The resultant plasmid was designated pSK34. 

Oligonucleotide sets encoding 35 amino acid "segments" were ligated into 
the unique Ear I site of the pSK34 base gene using procedures as described above. 
In this case, the oligonucleotides were not gel or HPLC purified but simply 
annealed and used in the ligation reactions. The following oligonucleotide sets 
were used: 



SEG 3 LEEKMKAMEDKMKW 
SMI 10 5 1 -GCTGGAAGAAAAGATGAAGGCTATGGAGGACAAGATGAAATGG 

SMI 11 3 ' -CCTTCTTTTCTACTTCCGATACCTCCTGTTCTACTTTACC 



L E E K M K K (SEQ ID NO: 85) 

(amino acids 8-28) 
CTTGAGGAAAAGATGAAGAA-3 ' (SEQ ID NO: 83) 

GAACXCCTTTTCTACTTCTTCGA-5 " (SEQ ID NO: 84) 



SEG 4 LEEKMKAMEDKMKW 
SM112 5 ' -GCTCGAAGAAAGATGAAGGCAATGGAAGACAAAATGAAGTGG 

SMI 13 3 1 -GCTTCTTTCTACTTCCGTTACCTTCTGTTTTACTTCACC 

L E E K M K K (SEQ ID NO: 86) 

(amino acids 8-28) 
CTTGAGGAGAAAATGAAGAA-3' (SEQ ID NO: 87) 

GAACTCCTCTTTTACTTCTTCGA-5 1 (SEQ ID NO: 88) 



SEG 5 LKEEMAKMKDEMWK 
SMI 14 5 1 -GCTCAAGGAGGAAATGGCTAAGATGAAAGACGAAATGTGGAAA 

SMI 15 3 • -GTTCCTCCTTTACCGATTCTACTTTCTGCTTTACACCTTT 



L K E E M K K (SEQ ID NO: 89) 

(amino acids 8-28) 
CTGAAAGAGGAAATGAAGAA (SEQ ID NO: 90) 

GACTTTCTCCTTTACTTCTTCGA (SEQ ID NO: 91) 
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Clones were screened for the presence of the inserted segments by restriction 
digestion followed by separation of fragments on 6% acrylamide gels. Correct 
insertion of oligonucleotides was confirmed by DNA sequence analyses. Clones 
containing segments 3, 4 and 5 respectively were designated pSKseg3, pSKseg4, 
and pSKsegS. 

These "segment" clones were used in a duplication scheme as shown in 
Figure 14. Ten \ig of plasmid pSKseg3 were digested to completion with Nhe I 
and BspH I and the 1503 bp fragment isolated from an agarose gel using the 
Whatmann paper technique. Ten \xg of plasmid pSKseg4 were digested to 
completion with Nhe I and Nco I and the 2109 bp band gel isolated. Equal 
amounts of these fragments were ligated and recombinants selected on tetracycline. 
Clones were screened by restriction digestions and their sequences confirmed. The 
resultant plasmid was designated pSKseg34. 

pSKseg34 and pSKsegS plasmid DNAs were digested, fragments isolated 
and ligated in a similar manner as above to create a plasmid containing DNA 
sequences encoding segment 5 fused to segments 3 and 4. This construct was 
designated pSKseg534 and encodes the following amino acid sequence: 

SSP534 NH2 -MEEKMKKLKEEMAKMKDE1WKLKE 

MEDKMKWLEEKMKKLEEKMKVMEEKMKKL^ 
LEEKMKVMK-COOH <SEQ ID NO: 92) 

EXAMPLE 22 

Construction of SSP Chimeric Genes for Expression in the Seeds of Plants 
To express the synthetic gene products described in Example 21 in plant 
seeds, the sequences were transferred to the seed promoter vectors CW108, 
CW109 or ML113 (Figure 15>. The vectors CW108 and ML113 contain the bean 
phaseolin promoter (from base +1 to base -494),and 1191 bases of the 3' sequences 
from bean phaseolin gene. CW109 contains the soybean {3-conglycinin promoter 
(from base +1 to base -619) and the same 1191 bases of 3' sequences from the 
bean phaseolin gene. These vectors were designed to allow direct cloning of 
coding sequences into unique Nco I and Asp 718 sites. These vectors also provide 
sites (Hind IE or Sal I) at the 5' and 3' ends to allow transfer of the 
promoter/coding region/3 1 sequences directly to appropriate binary vectors. 

To insert the synthetic storage protein gene sequences, 10 jig of vector DNA 
were digested to completion with Asp 718 and Nco I restriction endonucleases. 
The linearized vector was purified via electrophoresis on a 1.0% agarose gel 
overnight electrophoresis at 15 volts. The fragment was collected by cutting the 
agarose in front of the band, inserting a 10 X 5 mm piece of Whatman 3MM paper 
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into the agarose and electrophoresing the fragment into the paper [Errington, 
(1990) Nucleic Acids Research, 18:17]. The fragment and buffer were spun out of 
the paper by centrifugation and the DNA in the -100 \iL was precipitated by 
adding 10 mg of tRNA, 10 y,L of 3 M sodium acetate and 200 jiL of ethanol. The 
precipitated DNA was washed twice with 70% ethanol and dried under vacuum. 
The fragment DNA was resuspended in 20 ftL of water and a portion diluted 
10-fold for use in ligation reactions. 

Plasmid DNA (10 mg) from clone 3-5 (carrying the SSP3-5 coding 
sequence) and pSK534 (carrying the SSP534 coding sequence) was digested to 
completion with Asp 718 and Nco I restriction endonucleases. The digestion 
products were separated on an 18% polyacrylamide non-denaturing gel. Gel slices 
containing the desired fragments were cut from the gel and purified by inserting the 
gel slices into a 1% agarose gel and electrophoresing for 20 min at 100 volts. 
DNA fragments were collected on 10 X 5 mm pieces of Whatman 3MM paper, the 
buffer and fragments spun out by centrifugation and the DNA precipitated with 
ethanol. The fragments were resuspended in 6 jlL water. One microliter of the 
diluted vector fragment described above, 2 \xL of 5X ligation buffer and 1 ^tL of 
T4 DNA ligase were added. The mixture was ligated overnight at 15°- 

The ligation mixes were transformed into E. coli strain DH5a [supE44 del 
lacU169 (phi 80 lacZ del M15) hsdR17 recAl endAl gyrl96 thil relAl] and 
ampicillin-resistant colonies selected. The clones were screened by restriction 
endonuclease digestion analyses of rapid plasmid DNAs and by DNA sequencing. 

EXAMPLE 23 
Tobacco Plants Containing the Chimeric Genes Phaseolin 
PrQmoter/cts fl y$C-M4 and ft-conglycinin promQter/SSP3-5 

The binary vector pZS97 was used to transfer the chimeric SSP3-5 gene of 
Example 22 and the chimeric £. coli dapA and lysC-M4 genes of Example 4 to 
tobacco plants. Binary vector pZS97 (Figure 6) is part of a binary Ti plasmid 
vector system [Bevan, (1984) NucL Acids. Res. /2:871 1-8720] of Agrobacterium 
tumefaciens. The vector contains: (1) the chimeric gene nopaline 
synthase: :neomycin phosphotransferase (nos::NPTII) as a selectable marker for 
transformed plant cells [Bevan et al., (1983) Nature 304:184-186], (2) the left and 
right borders of the T-DNA of the Ti plasmid [Bevan, (1984) NucL Acids. Res. 
12:8711-8720], (3) the E. coli lacZ a-complementing segment [Viering et al., 
(1982) Gene 79:259-267] with a unique Sal I site(pSK97K) or unique Hind m site 
(pZS97) in the polylinker region, (4) the bacterial replication origin from the 
Pseudomonas plasmid pVSl [Itoh et al., (1984) Plasmid ii:206-220], and (5) the 
bacterial ^-lactamase gene as a selectable marker for transformed A. tumefaciens. 
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Plasmid pZS97 DNA was digested to completion with Hind EQ enzyme and 
the digested plasmid was gel purified. The Hind HI digested pZS97 DNA was 
mixed with the Hind ED digested and gel isolated chimeric SSP3-5 gene of 
Example 22, ligated, transformed and colonies selected on ampicillin. 

The binary vector containing the chimeric gene was transferred by tri- 
parental mating [Ruvkin et al., (1981) Nature 259:85-88] to Agrobacterium strain 
LBA4404/fcAL4404 [Hockema et al., (1983), Nature 50J:179-18O] selecting for 
carbenicillin resistance. Cultures of Agrobacterium containing the binary vector 
was used to transform tobacco leaf disks [Horsch et al., (1985) Science 
227: 1229-1231]. Transgenic plants were regenerated in selective medium 
containing kanamycin. 

Transformed tobacco plants containing the chimeric gene, p-conglycinin 
promoter/SSP3-5^phaseolin 3* region, were thus obtained. Two transformed lines, 
pSK44-3A and pSK44-9A, which carried a single site insertion of the SSP3-5 gene 
were identified based upon 3:1 segregation of the marker gene for kanamycin 
resistance. Progeny of the primary transform ants, which were homozygous for the 
transgene, pSK44-3A-6 and pSK44-9A-5, were then identified based upon 4:0 
segregation of the kanamycin resistance in seeds of these plants. 

Similarly, transformed tobacco plants with the chimeric genes phaseolin 5' 
region/cts /lysC -M4/phaseolin 3 f region and phaseolin 5 f 

region/cts/eco dapA/ phaseolin 3' region were obtained as described in Example 12. 
A transformed line, BT570-45A, which carried a single site insertion of the 
DHDPS and AK genes was identified based upon 3: 1 segregation of the marker 
gene for kanamycin resistance. Progeny from the primary transform ant which were 
homozygous for the transgene, BT570-45A-3 and BT570-45A-4, were then 
identified based upon 4:0 segregation of the kanamycin resistance in seeds of these 
plants. 

To generate plants carrying all three chimeric genes genetic crosses were 
performed using the homozygous parents. Plants were grown to maturity in 
greenhouse conditions. Flowers to be used as male and female were selected one 
day before opening and older flowers on the inflorescence removed. For crossing, 
female flowers were chosen at the point just before opening when the anthers were 
not dehiscent. The corolla was opened on one side and the anthers removed. Male 
flowers were chosen as flowers which had opened on the same day and had 
dehiscent anthers shedding mature pollen. The anthers were removed and used to 
pollinate the pistils of the anther-stripped female flowers. The pistils were then 
covered with plastic tubing to prevent further pollination. The seed pods were 
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allowed to develop and dry for 4-6 weeks and harvested. Two to three separate 
pods were recovered from each cross. The following crosses were performed: 



Male X Female 

5 BT570-45A-3 pSK44-3A-6 

BT570r45A-4 pSK44-3A-6 
pSK44-3A-6 BT570-45A-4 
BT570-45A-5 pSK44-9A-5 
pSk44-9A-5 BT570-45A-5 
10 Dried seed pods were broken open and seeds collected and pooled from each 

cross. Thirty seeds were counted out for each cross and for controls seeds from 
selfed flowers of each parent were used. Duplicate seed samples were hydrolyzed 

H; and assayed for total amino acid content as described in Example 8. The amount 

£3 

p of increase in lysine as a percent of total seeds amino acids over wild type seeds, 

fy 15 which contain 2.56% lysine, is ^presented in Table 16 along the copy number of 
p each gene in the endosperm of the seed. 

m 

Jf J TABLE 17 

y* copy number 



male 


X 


female 


AK&DHDPS 
genes 


copy number 
SSP gene 


lysine 
increase 


BT570-45A 


X 


BT570-45A 


1* 


0 


0 


pSK44-9A 


X 


pSK44-9A 


0 


1* 


0.12 


pSK44-9A-5 


X 


pSK44-9A-5 


0 


2 


0.29 


DSK44-9A-5 


X 


BT570-45A-5 


1 


1 


0.6 


BT570-45A-5 


X 


pSK44-9A-5 


1 


1 


0.29 


pSK44-3A 


X 


pSK44-3A 


0 


1* 


0.28 


pSK44-3A-6 


X 


pSK44-3A-6 


0 


2 


0.5 


pSK44-3A-6 


X 


BT57(M5A-4 


1 


1 


0.62 


BT570-45A-3 


X 


pSK44-3A-6 


1 


1 


0.27 


BT570-45A-4 


X 


pSK44-3A-6 


1 


1 


0.29 



* copy number is average in population of seeds 



The results of these crosses demonstrate that the total lysine levels in seeds 
20 can be increased by the coordinate expression of the lysine biosynthesis genes and 
the high lysine protein SSP3-5. In seeds derived from hybrid tobacco plants, this 
synergism is strongest when the biosynthesis genes are derived from the female 
parent It is expected that the lysine level would be further increased if the 
biosynthesis genes and the lysine-rich protein genes were all homozygous. 
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EXAMPLE 24 

Soybean Plants Containing the Chimeric Genes Phaseolin Promoter/cts/cordapA. 
Phaseolin Promoter/cts/lvsC-M4 and Phaseolin Promoter/SSP3-5 
Transformed soybean plants that express the chimeric gene, phaseolin 
5 promoter/cts/cordapA/ phaseolin 3' region and phaseolin promoter/cts/lyg£-M4/ 
phaseolin 3' region have been described in Example 19. Transformed soybean 
plants that express the chimeric gene, phaseolin promoter/SSP3-5^phaseolin 3' 
region, were obtained by inserting the chimeric gene as an isolated Hind HI 
fragment into an equivalent soybean transformation vector plasmid pML63 
10 (Figure 16) and carrying out transformation as described in Example 19. 

Seeds from primary transform ants were sampled by cutting small chips from 
the sides of the seeds away from the embryonic axis. The chips were assayed for 
GUS activity as described in Example 19 to determine which of the segregating 
seeds carried the transgenes. Half seeds were ground to meal and assayed for 
15 expression of SSP3-5 protein by Enzyme Linked Immunosorbent Assay (ELISA). 
Elisa was performed as follows: 
01 A fusion protein of glutathione-S -transferase and the SSP3-5 gene product 

y § was generated through the use of the Pharmacia^ pGEX GST Gene Fusion System 

Li: (Current Protocols in Molecular Biology, Vol. 2, pp 16.7. 1-8, (1989) John Wiley 

Hi 20 and Sons). The fusion protein was purified by affinity chromatography on 
vj glutathione agarose (Sigma) or glutathione sepharose (Pharmacia) beads, 

D concentrated using Centricon 10_ (Amicon) filters, and then subjected to SDS 

polyacrylamide electrophoresis (15% Acrylamide, 19:1 Acrylamide:Bisaciylamide) 
for further purification. The gel was stained with Coomassie Blue for 30 min, 
25 destained in 50% Methanol, 10% Acetic Acid and the protein bands electroeluted 
using an Amicon_ Centiluter Microelectroeluter (Paul T* Matsudaira ed., A 
Practical Guide to Protein and Peptide Purification for Microsequencing, 
Academic Press, Inc. New York, 1989). A second gel prepared and run in the 
same manner was stained in a non acetic acid containing stain [9 parts 0.1% 
30 Coomassie Blue G250 (Bio-Rad) in 50% methanol and 1 part Serva Blue (Serva, 
Westbury, NY) in distilled water] for 1-2 h. The gel was briefly destained in 20% 
methanol, 3% glycerol for 0.5-1 h until the GST-SSP3-5 band was just barely 
visible. This band was excised from the gel and sent with the electroeluted material 
to Hazelton Laboratories for use as an antigen in immunizing a New Zealand 
35 Rabbit. A total of 1 mg of antigen was used (0.8 mg in gel, 0.2 mg in solution). 
Test bleeds were provided by Hazelton Laboratories every three weeks. The 
approximate titer was tested by western blotting ofE. coli extracts from cells 
containing the SSP-3-5 gene under the control of the T7 promoter at different 
dilutions of protein and of serum. 
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IgG was isolated from the serum using a Protein A sepharose column. The 
IgG was coated onto microtiter plates at 5 \ig per well. A separate portion of the 
IgG was biotinylated. 

Aqueous extracts from transgenic plants were diluted and loaded into the 
wells usually starting with a sample containing 1 |Xg of total protein. The sample 
was diluted several more times to insure that at least one of the dilutions gave a 
result that was within the range of a standard curve generated on the same plate. 
The standard curve was generated using chemically synthesized SSP3-5 protein. 
The samples were incubated for 1 h at 37° and the plates washed. The biotinylated 
IgG was then added to the wells. The plate was incubated at 37° for 1 h and 
washed. Alkaline phosphatase conjugated to streptavidin was added to the wells, 
incubated at 37° for 1 h and washed. A substrate consisting of 1 mg/mL p 
nitrophenylphosphate in 1 M diethanolamine was added to the wells and the plates 
incubated at 37° for 1 h. A 5% EDTA stop solution was added to the wells and 
the absorbance read at 405 ran minus 650 nm reading. Transgenic soybean seeds 
contained 0.5 to 2.0% of water extractable protein as SSP3-5. 

The remaining half seeds positive for GUS and SSP3-5 protein were planted 
and grown to maturity in greenhouse conditions. To determine homozygotes for 
the GUS phenotype, seed from these Rl plants were screened for segregation of 
GUS activity as above. Plants homozygous for the phaseolin/SSP3-5 gene are then 
crossed with homozygous transgenic soybeans expressing the Corynebacterium 
dapA gene product or expressing the Corynebacterium dapA gene product plus 
the E. coli lysC-M4 gene product. 

As an preferred alternative to bringing the chimeric SSP gene and chimeric 
cordagA gene plus the E. coli lysC -M4 gene together via genetic crossing, a single 
soybean tranformation vector carrying all the genes can be constructed from the 
gene fragments described above and transformed into soybean as described in 
Example 19. 

EXAMPLE 25 

Construction of Chimeric Genes for Ex pression o f Corynebacterium DHDPS. lvs r z 
Corn DHDPS. £. m li AKTTT-M 4 and SSP3-5 proteins in the Embryo and 

Endosperm of Transformed Com 
The following chimeric genes were made for transformation into com: 
globulin 1 promoter/mcts/ly£-M4/NOS 3' region 
globulin 1 promoter/mcts/cordapA/NOS 3 region 
glutelin 2 promoter/mcts/lysC-M4/NOS 3' region 
glutelin 2 promoterAncts/cor^apA/NOS 3' region 
globulin 1 promoter/SSP3-5/globulin 1 3' region 
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glutelin 2 promoter/SSP3-5/10 kD 3' region 

globulin 1 promoter/com lys r -mutant DHDPS gene/globulin 1 3* region 
glutelin 2 promoter/com lys r -mutant DHDPS gene/10 kD 3' region 
The glutelin 2 promoter was cloned from com genomic DNA using PCR 
5 with primers based on the published sequence [Reina et al. ( 1990) Nucleic Acids 
Res. /S:6426-6426]. The promoter fragment includes 1020 nucleotides upstream 
from the ATG translation start codon. An Nco I site was introduced via PCR at 
the ATG start site to allow for direct translational fusions. A BamH I site was 
introduced on the 5' end of the promoter. The 1 .02 kb BamH I to Nco I promoter 
10 fragment was cloned into the BamH I to Nco I sites of the plant expression vector 
pML63 (see Example 24) replacing the 35S promoter to create vector pML90. 
This vector contains the glutelin 2 promoter linked to the GUS coding region and 
5 g the NOS 3'. 

□ The 10 kD zein 3' region was derived from a 10 kD zein gene clone 

PH 1 5 generated by PCR from genomic DNA using oligonucleotide primers based on the 

□ published sequence [Kirihara et al. (1988) Gene 71 :359-370]. The 3' region 

CP extends 940 nucleotides from the stop codon. Restriction endonuclease sites for 

Kpn I, Sma I and Xba I sites were added immediately following the TAG stop 
H- codon by oligonucleotide insertion to facilitate cloning. A Sma I to Hind m 

ff 20 segment containing the 10 kD 3'region was isolated and ligated into Sma I and 
H Hind m digested pML90 to replace the NOS 3' sequence with the 10 kD 3'region, 

° thus creating plasmid pML103. pML103 contains the glutelin 2 promoter, an 

Nco I site at the ATG start codon of the GUS gene, Sma I and Xba I sites after the 
stop codon, and 940 nucleotides of the 10 kD zein 3' sequence. 
25 The globulin 1 promoter and 3' sequences were isolated from a Clontech 

com genomic DNA library using oligonucleotide probes based on the published 
sequence of the globulin 1 gene [Kriz et al. (1989) Plant Physiol. 92:636]. The 
cloned segment includes the promoter fragment extending 1078 nucleotides 
upstream from the ATG translation start codon, the entire globulin coding 
30 sequence including introns and the 3* sequence extending 803 bases from the 

translational stop. To allow replacement of the globulin 1 coding sequence with 
other coding sequences an Nco I site was introduced at the ATG start codon, and 
Kpn I and Xba I sites were introduced following the translational stop codon via 
PCR to create vector pCC50. There is a second Nco I site within the globulin 1 
35 promoter fragment. The globulin 1 gene cassette is flanked by Hind m sites. 

The plant amino acid biosynthetic enzymes are known to be localized in the 
chloroplasts and therefore are synthesized with a chloroplast targeting signal. 
Bacterial proteins such as DHDPS and AKffl have no such signal. A chloroplast 
transit sequence (cts) was therefore fused to the cordapA and lysC_-M4 coding 
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sequence in the chimeric genes described below. For corn the cts used was based 
on the the cts of the small subunit of ribulose 1,5-bisphosphate carboxylase from 
corn [Lebrun et al. (1987) Nucleic Acids Res. i5:4360] and is designated mcts to 
distinguish it from the soybean cts. The oligonucleotides SEQ ID NOS:94-99 
were synthesized and used as described in Example 6. 

To construct the chimeric gene: 
globulin 1 promoterAncts/ly§Q-M4/NOS 3' region 

an Nco I to Hpa I fragment containing the mcts /lysC -M4 coding sequence was 
isolated from plasmid pBT558 (see Example 6) and inserted into Nco I plus Sma I 
digested pCC50 creating plasmid pBT663. 

To construct the chimeric gene: 
globulin 1 promoter/mcts/cordapA/NOS 3 region 

an Nco I to Kpn I fragment containing the mcts/eco dapA coding sequence was 
isolated from plasmid pBT576 (see Example 6) and inserted into Nco I plus Kpn I 
digested pCC50 creating plasmid pBT662. Then the ec odapA coding sequence 
was replaced with the cordapA coding sequence as follows. An Afl II to Kpn I 
fragment containing the distal two thirds of the mcts fused to the cordapA coding 
sequence was inserted into Afl II to Kpn I digested pBT662 creating plasmid 
pBT677. 

To construct the chimeric gene: 
glutelin 2 promoter/mcts/lx^-M4/NOS 3* region 

an Nco I to Hpa I fragment containing the mcts/lvsC-M4 coding sequence was 
isolated from plasmid pBT558 (see Example 6) and inserted into Nco I plus Sma I 
digested pML90 creating plasmid pBT580. 

To construct the chimeric gene: 
glutelin 2 promoter/mcts/co rdapA/ NOS 3' region 

an Nco I to Kpn I fragment containing the mcts/cordapA coding sequence was 
isolated from plasmid pBT677 and inserted into Nco I to Kpn I digested pML90, 
creating plasmid pBT679. 

The chimeric genes: 
globulin 1 promoter/mcts /lysC -M4/NOS 3' region and 
globulin 1 promoter/mcts/co rdapA/ NOS 3 region 

were linked on one plasmid as follows. pBT677 was partially digested with 
Hind Ht and full-length linearized plasmid DNA was isolated. A Hind m fragment 
carrying the globulin 1 promoter/mcts/ly£-M4/NOS 3' region was isolated from 
pBT663 and ligated to the linearized pBT677 plasmid creating pBT680 
(Figure 17). 
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The chimeric genes: 
glutelin 2 promoter/mcts/lysC-M4/NOS 3' region and 
glutelin 2 promoter/mcts/cordagA/NOS 3' region 

were linked on one plasmid as follows. pBT580 was partially digested with Sal I 
and full-length linearized plasmid DNA was isolated. A Sal I fragment carrying the 
glutelin 2 promoter/mcts/co rdap A/ N OS 3' region was 
isolated from pBT679 and ligated to the linearized pBT580 plasmid creating 
pBT681 (Figure 18). 

To construct the chimeric gene: 
glutelin 2 promoter/SSP3-5/10 kD 3' region 

the plasmid pML103 (above) containing the glutelin 2 promoter and 10 kD zein 3' 
region was cleaved at the Nco I and Sma I sites. The SSP3-5 coding region 
(Example 22) was isolated as an Nco I to blunt end fragment by cleaving with 
Xba I followed by filling in the sticky end using Klenow fragment of DNA 
polymerase, then cleaving with Nco I. The 193 base pair Nco I to blunt end 
fragment was ligated into the Nco I and Sma I cut pML103 to create pLH104 
(Figure 19). 

To construct the chimeric gene: 
globulin 1 promoter/SSP3-5/globulin 1 3'region 

the 193 base pair Nco I and Xba I fragment containing the SSP3-5 coding region 
(Example 22) was inserted into plasmid pCC50 (above) between the globulin 1 5* 
and y regions creating pLH105 (Figure 20). 

The com DHDPS cDNA gene was cloned and sequenced previously [Frisch 
et al. (1991 ) Mol Gen Genet 225:287-293]. A mutation that rendered the protein 
insensitive to feedback inhibition by lysine was introduced into the gene. This 
mutation is a single nucleotide change that results in a single amino acid 
substitution in the protein; alal66 is changed to val. The lys r com DHDPS gene 
was obtained from Dr. Burle Gengenbach at the University of Minnesota. An 
Nco I site was introduced at the translation start codon of the gene and a Kpn I site 
was introduced immediately following the translation stop codon of the gene via 
PCR using the following primers: 

SEQ ID NO: 106: 5'-ATTCCCCATG GTTTCGCCGA CGAAT 
SEQ ID NO: 107: 5 -CTCTCGGTAC CTAGTACCTA CTGATCAAC 
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To construct the chimeric gene: 
globulin 1 promoter/lys r com DHDPS gene/globulin 1 3'region the 1144 base pair 
Nco I and Kpn I fragment containing the lys r com DHDPS gene was inserted into 
plasmid pCC50 (above) between the globulin 1 5' and 3* regions creating pBT739 
(Figure 21). 

To construct the chimeric gene: 
glutelin 2 promoter/lys r com DHDPS gene/10 kD 3* region 
the 1 144 base pair Nco I and Kpn I fragment containing the lys r com DHDPS gene 
was inserted into a plasmid containing the glutelin 2 promoter and 10 kD zein 3' 
region creating plasmid pBT756 (Figure 22). 

Com transformations were done as described in Examples 17 and 18 with the 
following exceptions: 

1) Embryogenic cell culture development was as described in Example 17 
except the exact culture used for bombardment was designated LH132.5.X, or 
LH132.6.X. 

2) The selectable marker used for these experiments was either the 35S/bar 
gene from pDETRIC as described in Example 18 or 35S/Ac, a synthetic 
phosphinothricin-N-acetyltransferase (pat) gene under the control of the 35S 
promoter and 3' terminator/ polyadenylation signal from Cauliflower Mosiac Virus 
[Eckes et.al., (1989) / Cell Biochem Suppl 13 D] 

3) The bombardment parameters were as described for Example 17 and 18 
except that the bombardments were performed as ,, tribombardments ,, by co- 
precipitating 1.5 pg of each of the DNAs (35S/bar or 35S/Ac, pBT681 and 
pLH104 or 35S/Ac, pbt680 and pLH105) onto the gold particles. 

4) Selection of transgenic cell lines was as described for glufosinate selection 
as in Example 18 except that the tissue was placed on the selection media within 
24 h after bombardment. 

EXAMPLE 26 

Com Plants Containing Chimeric Genes for Expression of Corynebacterium 
DHDPS and F mliAKJUMA or ly ^-Com DHDPS in the Embrvo and Endosperm 
Com was transformed as described in Example 25 with the chimeric genes: 

• globulin 1 promoter/focts/cordapA/NOS 3 region along with or without 

globulin 1 promoter/mcts /lysC -M4/NOS 3' region; or 

• glutelin 2 promoter/mcts/co rdap A7 N OS 3' region along with or without 

glutelin 2 promoter/mcts/lysQ-M4/NOS 3' region. 
Plants regenerated from transformed callus were analyzed for the presence of 
the intact transgenes via Southern blot or PCR. The plants were either selfed or 
outcrossed to an elite line to generate Fl seeds. Six to eight seeds were pooled 
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and assayed for expression of the Corynebacterium DHDPS protein and the E. coli 
AKHI-M4 protein by western blot analysis. The free amino acid composition and 
total amino acid composition of the seeds were determined as described in previous 
examples. 

Expression of the Corynebacterium DHDPS protein, driven by either the 
globulin 1 or glutelin 2 promoter, was observed in the com seeds (Table 12). 
Expression of the E. coli AKIEI-M4 protein, driven by the glutelin promoter was 
also observed in the com seeds. Free lysine levels in the seeds increased from 
about 1.4% of free amino acids in control seeds to 15-27% in seeds of three 
different transfonnants expressing Corynebacterium DHDPS from the globulin 1 
promoter. TTie increased free lysine, and a high level of saccharopine, indicative of 
lysine catabolism, were both localized to the embryo in seeds expressing 
Corynebacterium DHDPS from the globulin 1 promoter. No increase in free lysine 
was observed in seeds expressing Corynebacterium DHDPS from the glutelin 2 
promoter with or without E. coli AKHI-M4. Lysine catabolism is expected to be 
much greater in the endosperm than the embryo and this probably prevents the 
accumulation of increased levels of lysine in seeds expressing Corynebacterium 
DHDPS plus £. coli AKIH-M4 from the glutelin 2 promoter. 

Lysine normally represents about 2.3% of the seed amino acid content. It is 
therefore apparent from Table 12 that a 130% increase in lysine as a percent of 
total seed amino acids was found in seeds expressing Corynebacterium DHDPS 
from the globulin 1 promoter. 



TRANSGENIC 
LINE 



T AB LE 12 

WESTERN WESTERN 
C QRYNR ECQLl 



PROMOTER DHDPS 



AKIH-M4 



%LYSOF 
FREE SEED 
AMINO ACIDS 



%LYSOF 
TOTAL SEED 
AMINO ACIDS 



1088.1.2 x elite globulin 1 
1089.4.2 x elite globulin 1 
1099.2.1 x self globulin 1 



+ 
+ 



15 
21 
27 



3.6 
5.1 
5.3 



1090.2. 1 x elite glutelin 2 
1092.2. 1 x elite glutelin 2 



+ 
+ 



1.2 
1.1 



1.7 
2.2 
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